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Nu cleic acid molecules specific for bacterial 
antigens and uses thereof . 



TECHNICAL FIELD 

The invention relates to novel nucleotide sequences 
located in a gene cluster which controls the synthesis of 
a bacterial polysaccharide antigen, especially an O 
antigen, and the use of those nucleotide sequences for the 
detection of bacteria which express particular 
polysaccharide antigens (particularly O antigens) and for 
the identification of the polysaccharide antigens 
(particularly O antigens) of those bacteria. 

BACKGROUND ART 

Enteropathogenic EL. coli strains are well known 
causes of diarrhoea and haemorrhagic colitis in humans and 
can lead to potentially life threatening sequelae 
including haemo lytic uremic syndrome and thrombotic 
throrabocytopaenic purpura. Some of these strains are 
0 commonly found in livestock and infection in humans is 

usually a consequence of consumption of contaminated meat 
or dairy products which have been improperly processed. 
The O specific polysaccharide component (the "O antigen") 
of lipopolysaccharide is known to be a major virulence 
5 factor of enteropathogenic IL coli strains. 

The EL. coli O antigen is highly polymorphic and 166 
different forms of the antigen have been defined; Ewing, 
W. H. [in Edwards and Ewings "Identification of the 
Enterobacteriacea" Elsevier. Amsterdam (1986)] discusses 
0 12 8 different O antigens while Lior H. (1994) extends the 
number to 166 [in "Classification of Escherichia, coli In 
Escherichia coli in domestic animals and humans pp31-72. 
Edited by C.L. Gyles CAB International]. The genus 
Salmonella enterica has 46 known O antigen types [Popoff 
5 M.Y. et al (1992) " Antigenic formulas of the Salmonella 
enterica serovars" 6th revision WHO Collaborating Centre 
for Reference and Research on Salmonella enterica , Institut 
Pasteur Paris France] . 
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An important step in determining the biosynthesis of 
O antigens -and therefore the mechanism of the polymorphism 
has been to characterise the gene clusters controlling O 
antigen biosynthesis. The genes specific for the 
synthesis of the O antigen are generally located in a gene 
cluster at map position 45 minutes oh the chromosome of E . 
coli K-12 [Bachmann, B. J. 1990 "Linkage map of 
Escherichia coli K-12". Microbiol. Rev. 54: 130-197], and 
at the corresponding position in S-s. enterica LT2 
[Sanderson et al (1995) "Genetic map of Salmonella 
enterica typhimurium" , Edition VTII Microbiol. Rev. 59: 
241-3 03] . in both cases the O antigen gene cluster is 
close to the gnd gene as is the case in other strains of 
L. coli and S^ enterica [Reeves P.R. (1994) "Biosynthesis 
and assemby of lipopoly saccharide, 281-314. in A. 
Neuberger and L.D.M. van Deenen (eds) "Bacterial cell 
wall, new comprehensive biochemistry " vol 27 Elsevier 
Science Publishers] . Thesfe greries ^encode enzymes for the 
synthesis of nucleotide diphosphate sugars and for 
0 assembly of the sugars into oligosaccharide units and in 
general for polymerisation to O antigen. 

The coli O antigen gene clusters for a wide range 
of EL_ coli O antigens have been cloned but the 07, 09, 016 
and Olll O antigens have k^ra , studied in more detail with 
5 only 09 and Ol6 having been fully characterised with 

regard to nucleotide sequence to date [Kido N. , Torgov 
V.I., Sugiyama T. , Uchiya K* , Sugihara H., Komatsu T., 
Kato N. & Jann K. (1S|95) "Egression of the 09 
polysaccharide of Escherichia, coli: sequencing of the E. 
0 coli 09 rfb gene cluster, characterisation of mannosyl 
transferases, and evidence for an ATP-binding cassette 
transport system" *J. of Bacteriol. 177 2178-2187; 
Stevenson G., Neal B. , Liu p. J Hobbs ML, Packer N.H., 
Bat ley M. , Redmond iJ.W., Lindguist L. & Reeves PR (1994) 
5 "Structure of the O antigen of E. coli K12 and the 

sequence of its rfb gene cluster" J. of Bacteriol. 176 
4144-4156; Jayaratne, P. et al. (1991) "Cloning and 
analysis of duplicated rfbM and rfbK genes involved in the 
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formation of GDP-mannose in Escherichia coli O9:K30 and 
participation of rfb genes in the synthesis of" "the group 1 
K30 capsular polysaccharide" J. Bacterid . 176: 3126-3139; 
Valvano, M. A. and Crosa, J. H.(1989)" Molecular cloning 
and expression in Escherichia coli K-12 of chromosomal 
genes determining the 07 lipopolysaccharide antigen of a 
human invasive strain of E.coli 07:K1". Inf and Immun. 
57:937-943; Marolda C. L. And Valvano, M. A. (1993). 
"Identification, expression, and DNA sequence of the GDP- 
mannose biosynthesis genes encoded by the 07 rfb gene 
cluster of strain VW187 (Eschericia coli 07:iOJ". j. 
Bacterid. 175:148-158. ] 

Bastin D.A. , et al . 1991 [^Molecular cloning and 
expression in Escherichia coli K-12 of the rfb gene 
15 cluster determining the O antigen of an E.coli 0111 

strain". Mol . Microbiol. 5:9 2223-2231] and Bastin D.A. 
and Reeves, P.R. [(1995)" Sequence and analysis of the O 
antigen gene (rfb) cluster of Escherichia coli 0111" . Gene 
164: 17-23] isolated chromosomal DNA encoding the EL_ coli 
20 0111 rfb region and characterised a 6962 bp fragment of E. 
coli 0111 rfb. Six open reading frames (orfs) were 
identified in the 6962 bp partial fragment and the 
alignment of the sequences of these orfs revealed homology 
with genes of the GDP-mannose pathway, rfbK and rfbM, and 
25 other rfb and cps genes. 

The nucleotide sequences of the loci which control 
expression of Salmone lla enterica B, A, Dl, D2 , D3 , CI, C2 
and E O antigens have been characterised [Brown, P. K. , L. 
K. Romana and P. R. Reeves (1991) "Cloning of the rfb gene 
cluster of a group C2 Salmonella enterica ; comparison with 
the rfb regions of groups B and D Mol. Microbiol. 5:1873- 
1881; Jiang, X.-M., B. Neal, F. Santiago, S. J. Lee, L. K. 
Romana, and P. R. Reeves (1991) ^Structure and sequence 
of the rfb (0 antigen) gene cluster of Salmonella enterica 
35 serovar typhimurium { LT2 ) * . Mol. Microbiol. 5:692-713; 
Lee, S. J., l. K. Romana, and P. R. Reeves (1992) 
^Sequences and structural analysis of the rfb (O 
antigen) gene cluster from a group CI Salmonella enterica 
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enterica strain" J. Gen. Microbiol. 138: 1843-1855; Lui , 
D . , N . K. Verma, L . K. Romana, and P. R. Reeves (1991) 
"Relationship among the rfb regions of Salmonella enterica 
serovars A, B and D" J. Bacteriol. 173: 4814-4819; Verma, 
N. K. , and P. Reeves (1989) * Identification and sequence 
of rfbS and rf±>E, which determine the antigenic 
specificity of group A and group D Salmonella enterica e* 
J. Bacteriol. 171: 5694-5701; Wang, L., L. K. Romana, and 
P. R. Reeves (1992) "Molecular analysis of a Salmonella 
enterica enterica group El rfb gene cluster: 0 antigen and 
the genetic basis of the major polymorphism" Genetics 
130: 429-443; Wyk, P., and P. Reeves (1989). 
"Identif ication and sequence of the gene for abequose 
synthase, which confers antigenic specificity on group B 
Salmonella enterica e : homology with galactose epimerase" 
J. Bacteriol. 171: 5687-5693,; Xiang, S. H. , M. Hobbs , and 
P. R. Reeves. 1994 Molecular analysis of the rfb gene 
luster of a group D2 Salmonella enterica strain: evidence 
for its origin from an insertion sequence -mediated 
recombination event between group E and Dl strains. J. 
Bacteriol. 176: 4357 -4365; Curd, H., D. Liu and P. R. 
Reeves, 1998. Relationships among the O antigen Salmonella 
enterica groups B, Dl, D2 , and D3 . J. Bacteriol. 180: 
1002-1007. ] . 

Of the closely related Shigella (which really can be 
considered to be part of E_t_ coli ) S . dvsenteriae and S. 
f lexneri 0 antigens have been fully sequenced and are next 
to gnd. [Klena JD & Schnaitman CA (1993) "Function of the 
rfb gene cluster and the rfe gene in the synthesis of O 
antigen by Shigella, dysenteriae 1" Mol. Microbiol. 9 393- 
402; Morona R. , Mavris M. , Fallarino A. & Manning P. 
(1994) "Characterisation of the rfc region of Shigella, 
f lexneri" J. Bacteriol 176: 733-747] 

Inasmuch as the O antigen of enteropathogenic JL=_ coli 
strains and the O antigen of Salmonella enterica strains 
are major virulence factors and are highly polymorphic, 
there is a real need to develop highly specific, 
sensitive, rapid and inexpensive diagnostic assays to 
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detect coli and assays to detect enterica . There is 
also a rea-l need to develop diagnostic assays to identify 
the O antigens of L coli strains and assays to identify 
the 0 antigens of enterica strains. With regard to the 
5 detection of IL_ coli these needs extend beyond EHEC 

( enteropa thogenic haemorrhagic EL. coli) strains but this 
is the area of greatest need. There is interest in 
diagnostics for ETEC (enterotoxigenic E^ coli) etc in E. 
coli . 

10 The first diagnostic systems employed in this field 

used large panels of antisera raised against E^ coli O 
antigen expressing strains or enterica O antigen 
expressing strains. This technology has inherent 
difficulties associated with the preparation, storage and 
15 usage of the reagents, as well as the time required to 
achieve a meaningful diagnostic result. 

Nucleotide sequences derived from the O antigen gene 
clusters of enterica strains have been used to 
determine enterica O antigens in a PCR assay [Luk, 
20 J.M.C. efc al. (1993) "Selective amplification of abequose 
and paratose synthase genes {rfb) by polymerase chain 
reaction for identification of enterica major serogoups 
(A, B, C2, and D)", J. Clin. Microbiol. 31:2118-2123 ]. 
The prior complete nucleotide sequence characterisation of 
25 the entire rfb locus of serovars Typhimurium, Paratyphi A, 
Typhi, Muenchen, and Ana turn; representing groups B, A, Dl, 
C2 and El respectively enabled Luk et al. to select 
oligonucleotide primers specific for those serogroups. 
Thus the approach of Luk et al . was based on aligning 
30 known nucleotide sequences corresponding to CDP-abequose 
and CDP-paratose synthesis genes within the 0 antigen 
regions of £. enterica serogroups El, Dl, A, B and C2 and 
exploiting the observed nucleotide sequence differences in 
order to identify serotype-specif ic oligonucleotides. 
35 In an attempt to determine the O antigen serotype of 

a Shiga-like toxin producing EL_ coli strain, Paton, A. W. , 
et al. 1996 ["Molecular microbiological investigation of 
an outbreak of Hemolytic-Uremic Syndrome caused by dry 
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fermented sausage contaminated with Shiga-like toxin 
producing -Escherichia coli" . J. Clin. Microbiol . 34: 1622- 
1627], used oligonucleotides derived from the wbdl (ojrf6) 
region, which were believed to be specific to the coli 
Olll antigen and which were derived from EL. coli 0111 
sequence, in a PCR diagnostic assay. Unpublished reports 
indicate that the approach of Paton et al . is deficient in 
that the nucleotide sequences derived from wbdl may not 
specifically identify the 0111 antigen and in fact lead to 
detection of false positive results. Paton et al . 
disclose the detection of 5 Olll antigen isolates by PCR 
when in fact from only 3 of those isolates did they detect 
bacteria which reacted with Olll specific antiserum. 

DESCRIPTION OF THE INVENTION 

Whilst not wanting to be held to a particular 
hypothesis , the present inventors now believe that the 
reported false positives found with the Paton et al . 
method are due to the fact that the nucleic acid molecules 
0 employed by Paton et al . were derived from genes which 

have a putative function as a sugar pathway gene, [Bastin 
D.A. and Reeves, P.R. (1995) Sequence and analysis of the 
O antigen gene (rfb) cluster of Escherichia coli Olll. Gene 
164: 17-23] which they now believe to lack the necessary 
5 nucleotide sequence specificity to identify the E^ coli O 
antigen. The inventors now believe that many of the 
nucleic acid molecules derived from sugar pathway genes 
expressed in jjL*. enterica or other enterobacteria are also 
likely to lack the necessary nucleotide sequence 
0 specificity to identify specific O antigens or specific 
serotypes . 

In this regard it is important to note that the genes 
for the synthesis of a polysaccharide antigen include 
those related to the synthesis of the sugars present in 
5 the antigen (sugar pathway genes) and those related to the 
manipulation of those sugars to form the polysaccharide. 
The present invention is predominantly concerned with the 
latter group of genes , particularly the assembly and 
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transport genes such as transferase, polymerase and 
flippase genes. 

The present inventors have surprisingly found that 
the use of nucleic acid molecules derived from particular 
assembly and transport genes, particularly transferase, 
wzx and wzy genes, within O antigen gene clusters can 
improve the specificity of the detection and 
identification of O antigens. The present inventors 
believe that the invention is not necessarily limited to 
the detectipn of the particular O antigens which are 
encoded by the nucleic acid molecules exemplified herein, 
but has broad application for the detection of bacteria 
which express an O antigen and the identification of O 
antigens in general. Further because of the similarities 
between the gene clusters involved in the synthesis of O 
antigens and other polymorphic polysaccharide antigens, 
such as bacterial capsular antigens, the inventors believe 
that the methods and molecules of the present invention 
are also applicable to these other polysaccharide 
antigens . 

Accordingly, in one aspect the present invention 
relates to the identification of nucleic acid molecules 
which are useful for the detection and identification of 
specific bacterial polysaccharide antigens . 

The invention provides a nucleic acid molecule 
derived from: a gene encoding a transferase; or a gene 
encoding an enzyme for the transport or processing of a 
polysaccharide or oligosaccharide unit, including a wzx 
gene, wzy gene, or a gene with a similar function; the 
gene being involved in the synthesis of a particular 
bacterial polysaccharide antigen, 

wherein the sequence of the nucleic acid molecule is 
specific to the particular bacterial polysaccharide 
antigen . 

Polysaccharide antigens, such as capsular antigens of 
IL. coli (Type I and Type II) , the Virulence capsule of S . 
enterica sv Typhi and the capsules of species such as 
Streptococcus pneumoniae and Staphylococcus albus are 
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encoded by genes which include nucleotide sugar pathway- 
genes, sugar transferase genes and genes for the transport 
and processing of the polysaccharide or oligosaccharide 
unit. In some cases these are wzx or wzy but in other 
5 cases they are quite different because a different 
processing pathway is used- Examples of other gene 
clusters include the gene clusters for an extracellular 
polysaccharide of Streptococcus thermoohilus . an 
exopolysaccharide of Rhizobium melilotti and the K2 

10 capsule of Klebsiella pneumoniae . These all have genes 
which by experimental analysis, comparison of nucleotide 
sequence or predicted protein structure, can be seen to 
include nucleotide sugar pathway genes, sugar transferase 
genes and genes for oligosaccharide or polysaccharide 

15 processing . 

In the case of the coli K-12 colanic acid capsule 
gene cluster [Stevenson et al (1996) "Organization of the 
Escherichia coli K-12 gene cluster responsible for 
production of the extracellular polysaccharide colanic 

20 acid". J. Bacteriol 178: 4885-4893] genes from the three 
classes were identified either provisionally or 
definitively. Colanic acid capsule is classified with the 
Type I capsule of coli . 

The present inventors believe that, in general, 

25 transferase genes and genes for oligosaccharide processing 
will be more specific for a given capsule than the genes 
coding for the nucleotide sugar synthetic pathways as most 
sugars present in such capsules occur in the capsules of 
different serotypes. Thus the nucleotide sugar synthesis 

3 0 pathway genes could now be predicted to be common to more 
than one capsule type . 

As elaborated below the present inventors recognise 
that there may be polysaccharide antigen gene clusters 
which share transferase genes and/or genes for 

3 5 oligosaccharide or polysaccharide processing so that 

completely random selection of nucleotide sequences from 
within these genes may still lead to cross-reaction; an 
example with respect to capsular antigens is provided by 
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the JE^_ coli type II capsules for which only transferase 
genes are -sufficiently specific. However, the present 
inventors in light of their current results nonetheless 
consider the transferase genes or genes controlling 
5 oligosaccharide or polysaccharide processing to be 

superior targets for nucleotide sequence selection for the 
specific detection and characterisation of polysaccharide 
antigen types. Thus where there is similarity between 
particular genes, selection of nucleotide sequences from 

10 within other transferase genes or genes for 

oligosaccharide or polysaccharide processing from within 
the relevant gene cluster will still provide specificity, 
or alternatively the use of combinations of nucleotide 
sequences will p^ide^fehe desired specificity. The 

15 combinations of nucleotide sequences may include 

nucleotide sequences i^ir ived from pathway genes together 
with nucleotide* -sequ&^ from transferase, wzx or 

wzy genes . 

Thus the invention ^isb provides a panel of nucleic 

20 acid molecules wherein! 1 th^ iiucleic acid molecules are 

derived from a combination of genes encoding transferases 
and/or enzymes for the transport or processing of a 
polysaccharide or oligosaccharide unit including wzx or 
wzy genes; where in t1&£ iB^S^in^ t i on of genes is specific to 

25 the synthesis of polysaccharide 

antigen and wherein tli£ ^p^e'l of nucleic acid molecules is 
specific to a bacterial* polysaccharide antigen. In 
another preferred f6rm, ttte nucleic acid molecules are 
derived from a combination of genes encoding transferases 

30 and/or enzymes for the 1 tfis^sii>ort or processing of a 

polysaccharide or oligosacctiaride unit including wzx or 
wzy genes, together with nucleic acid molecules derived 
from pathway genes. ^; ^ 

In a sitebri|& ; "present invention relates to 

35 the identiflcd&i^ acid molecules which are 

useful for the :: &Mil£6£±o^.oit bacteria which express O 
antigens and for the identification of the O antigens of 
those bacteria in diagnostic assays . 
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The invention provides a nucleic acid molecule 
derived from: a gene encoding a transferase; or a gene 
encoding an enzyme for the transport or processing of a 
polysaccharide or oligosaccharide unit such as a wzx or 
wzy gene, the gene being involved in the synthesis of a 
particular bacterial O antigen, wherein the sequence of 
the nucleic acid molecule is specific to the particular 
bacterial 0 antigen. 

The nucleic acids of the invention may be variable in 
length. In one embodiment they are from about 10 to about 
20 nucleotides in length. 

In one preferred embodiment, the invention provides a 
nucleic acid molecule derived from: a gene encoding a 
transferase; or a gene encoding an enzyme for the 
transport or processing of a polysaccharide or 
oligosaccharide unit including a wzx ox wzy gene the gene 
being involved in the synthesis of an O antigen expressed 
by J2*. coli . wherein the sequence of the nucleic acid 
molecule is specific to the O antigen. 
0 In one more preferred embodiment, the sequence of the 

nucleic acid molecule is specific to the nucleotide 
sequence encoding the 0111 antigen (SEQ ID N0:1) . More 
preferably, the sequence is derived from a gene selected 
from the group consisting of wbdH (nucleotide position 739 
5 to 1932 of SEQ ID NO:l), wzx (nucleotide position 8646 to 
9911 of SEQ ID NO:l), wzy (nucleotide position 9901 to 
10953 of SEQ ID N0:1), wbdM (nucleotide position 11821 to 
12945 of SEQ ID NO:l) and fragments of those molecules of 
at least 10-12 nucleotides in length. Particularly 
0 preferred nucleic acid molecules are those set out in 

Table 5 and 5A, with respect to the above mentioned genes . 

In another more preferred embodiment, the sequence of 
the nucleic acid molecule is specific to the nucleotide 
sequence encoding the 0157 antigen (SEQ ID N0:2). More 
5 preferably the sequence is derived from a gene selected 

from the group consisting of wbdN (nucleotide position 7 9 
to 861 of SEQ ID N0:2), wbdO, (nucleotide position 2011 to 
2757 c f. SEQ ID NO : 2 ) , wbdP (nucleotide position 5257 to 
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6471 of SEQ ID NO:2)), wbdR (13156 to 13821 of SEQ ID 
NO:2), wzx- (nucleotide position 2744 to 4135 of" SEQ ID 



NO:2) and wzy (nucleotide position 858 to 2042 of SEQ ID 
NO:2). Particularly preferred nucleic acid molecules are 
those set out in Table 6 and 6A. 

The invention also provides in a further preferred 
embodiment a nucleic acid molecule derived from: a gene 
encoding a transferase; or a gene encoding an enzyme for 
the transport or processing of a polysaccharide or 
oligosaccharide unit including a wzx or wzy gene; the gene 
being involved in the synthesis of an O antigen expressed 
by Salmonella enterica , wherein the sequence of the 
nucleic acid molecule is specific to the O antigen. 

In one more preferred form of this embodiment, the 
sequence of the nucleic acid molecule is specific to the 
nucleotide sequence encoding the S^_ enterica C2 antigen 
(SEQ ID NO: 3) . More preferably the sequence of the 
nucleic acid molecule is derived from a gene selected from 
the group consisting of wbaR (nucleotide position 2352 to 
0 3314 of SEQ ID NO:3), wba.L (nucleotide position 3361 to 
3875 of SEQ ID NO:3), wbaQ (nucleotide position 3977 to 
5020 of SEQ ID NO:3), wbaW (nucleotide position 6313 to 
7323 of SEQ ID 1*0:3), wbaZ (nucleotide position 7310 to 
8467 of SEQ ID NO:3), wzx (nucleotide position 1019 to 
5 2359 of SEQ ID NO: 3) and wzy (nucleotide position 5114 to 

6313 of SEQ ID NO: 3) . Particularly preferred nucleic acid 
molecules are those set out in Table 7 . 

In another more preferred form of this embodiment, 
the sequence of the nucleic acid molecule is specific to 
0 the nucleotide sequence encoding the enterica B antigen 
(SEQ ID NO: 4) . More preferably the sequence is derived 
from wzx (nucleotide position 12762 to 14054 of SEQ ID 
NO: 4) or wbaV (nucleotide position 14059 to 15060 of SEQ 
ID NO: 4) . Particularly preferred nucleic acid molecules 
5 are those set out in Table 8 which are derived from wzx 
and wbaV genes ♦ 

In a further more preferred form of this embodiment, 
the sequence of the nucleic acid molecule is specific to 
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the enterica D3 O antigen and is derived from the wzy 
gene. 

In yet a further preferred form of this embodiment, 
the sequence of the nucleic acid molecule is specific to 
the enter jpa El O antigen and is derived from the wzx 
gene . 

While transferase genes, or genes coding for the 
transport or processing of a polysaccharide or 
oligosaccharide unit, such as a wzx or wzy gene, are 
superior targets for specific detection of individual O 
antigen types there may well be individual genes or parts 
of them within this group that can be demonstrated to be 
the same or closely related between different O antigen 
types such that; crQss^re^c^ipns can occur. Cross 
reactions should be avoided by the selection of a 
different target; within. the , group or the use of multiple 
targets withi^..,^,,^^, . .^:, f 

Further, i,t, r is. r^cpgn^s^ that there are cases where 
O antigen gene qlusters ha^e arisen from recombination of 
at least two strains such that the unique O antigen type 
is provided by a combination of gene products shared with 
at least two other O antigen types. The recognised 
example of this phenomenon JLs the ^ enterica O antigen 
serotype D2 which, has genes .fr^om Dl and El but none unique 
to D2. m these qir^xu^l^^ies the detection of the O 
antigen type ca# s till be , achieved in accordance with the 
invention, but requires the use of a combination of 
nucleic acid molecules .to. detect a specific combination of 
genes that exists only in t^at particular O antigen gene 
cluster. 

Thus , the invention also, provides a panel of nucleic 
acid molecules wherein, the ; nucleic acid molecules are 
derived from gen.es e^cb^ing, /transferases and/or enzymes 
for the transport ; ,.^px, ; sp^r|^e^^ng ,p£ , a polysaccharide or 
oligosaccharide ; ^it ; inclu^ng wzx or wzy genes, wherein 
the panel of nucleic acid molecules is specific to a 
bacterial O antigen. Preferably the particular bacterial 
O antigen is expressed by S^. enterica . More preferably. 
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the panel of nucleic acid molecules is specific to the D2 
O antigen and is derived -from the El wzy gene and the Dl 
wzx gene . 

The combinations of nucleotide sequences may include 
nucleotide sequences derived from pathway genes, together 
with nucleotide sequences derived from transferase, wzx or 
wzy genes. 

Thus, the invention also provides a panel of nucleic 
acid molecules, wherein the nucleic acid molecules are 
derived from genes encoding transferases and/or enzymes 
for the transport or processing of a polysaccharide or 
oligosaccharide unit including wzx or wzy genes, and sugar 
pathway genes, wherein the panel of nucleic acid molecules 
is specific to a particular bacterial O antigen. 
Preferably the O antigen is expressed enter ica . 

Further it is recognised that there may be instances 
where spurious hybridisation will arise through initial 
selection of a sequence found in many different genes but 
this is typically recognisable by, for instance, 
0 comparison of band sizes against controls in PCR gels, and 
an alternative sequence can be selected. 

The present inventors believe that based on the 
teachings of the present invention and available 
information concerning polysaccharide antigen gene 
5 clusters (including O antigen gene clusters) , and through 
use of experimental analysis, comparison of nucleic acid 
sequences or predicted protein structures, nucleic acid 
molecules in accordance with the invention can be readily 
derived for any particular polysaccharide antigen of 
>0 interest. Suitable bacterial strains can typically be 
acquired commercially from depositary institutions. 

As mentioned above there are currently 166 defined 
coli O antigens while the £L*. enterica has 46 known O 
antigen types [Popoff M.Y. et al (1992) "Antigenic 
J 5 formulas of the Salmonella serovars" 6th revision WHO 
Collaborating centre for Reference and Research on 
Salmonella, Institut Pasteur Paris France] . Many other 
genera of bacteria are known to have O antigens and these 
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inc lude Citrobacter , Shigella , Yersinia , Plesiomonas , 
Vibrio and: Proteus. 

Samples of the 166 different E. coli 0 antigen 
serotypes are available from Statens Serum Institut, 
Copenhagen , Denmark . 

The 46 enterica serotypes are available from 
Institute of Medical and Veterinary Science, Adelaide, 
Australia . 

In another aspect, the invention relates to a method 
of testing a sample for the presence of one or more 
bacterial polysaccharide antigens comprising contacting 
the sample with at least one oligonucleotide molecule 
capable of specifically hybridising to: (i) a gene 
encoding a transferase, or (ii) a gene encoding an enzyme 
for transport or processing of oligosaccharide or 
polysaccharide units, including a wzx or wzy gene; wherein 
said gene is involved in the synthesis of the bacterial 
polysaccharide antigen; under conditions suitable to 
permit the at least one oligonucleotide molecule to 
0 specifically hybridise to at least one such gene of any 
bacteria expressing the particular bacterial 
polysaccharide antigen present in the sample and detecting 
any specifically hybridised oligonucleotide molecules. 

Where a single specific oligonucleotide molecule is 
5 unavailable a combination of molecules hybridising 

specifically to the target region may be used. Thus the 
invention provides a panel of nucleic acid molecules for 
use in the method of testing of the invention, wherein the 
nucleic acid molecules are derived from genes encoding 
0 transferases and/or enzymes for the transport or 

processing of a polysaccharide or oligosaccharide unit 
including wzx or wzy genes, wherein the panel of nucleic 
acid molecules is specific to a particular bacterial 
polysaccharide. The panel of nucleic acid molecules can 
5 include nucleic acid molecules derived from sugar pathway 
genes where necessary. 

In another aspect, the invention relates to a method 
of testing a sample for the presence of one or more 
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bacterial polysaccharide antigens comprising contacting 
the sample -with at least one pair of oligonucleotide 
molecules, with at least one oligonucleotide molecule of 
the pair capable of specifically hybridising to: (i) a 
gene encoding a transferase, or (ii) a gene encoding an 
enzyme for transport or processing oligosaccharide or 
polysaccharide units, including a wzx or wzy gene; wherein 
said gene is involved in the synthesis of the bacterial 
polysaccharide antigen; under conditions suitable to 
permit the at least one oligonucleotide molecule of the 
pair of molecules to specifically hybridise to at least 
one such gene of any bacteria expressing the particular 
bacterial polysaccharide antigen present in the sample and 
detecting any specifically hybridised oligonucleotide 
molecules . 

The pair of oligonucleotide molecules may both 
hybridise to the same gene or to different genes. Only 
one oligonucleotide molecule of the pair need hybridise 
specifically to sequence specific for the particular 
0 antigen type. The other molecule can hybridise to a non- 
specific region. 

Where the particular polysaccharide antigen gene 
cluster has arisen through recombination, the at least one 
pair of oligonucleotide molecules may be selected to be 
5 capable of hybridising to a specific combination of genes 
in the cluster specific to that polysaccharide antigen, or 
multiple pairs may be selected to provide hybridisation to 
the specific combination of genes. Even where all the 
genes in a particular cluster are unique, the method may 
0 be carried out using nucleotide molecules which recognise 
a combination of genes within the cluster. 

Thus the invention provides a panel containing pairs 
of nucleic acid molecules for use in the method of testing 
of the invention, wherein the pairs of nucleic acid 
5 molecules are derived from genes encoding transferases 
and/or enzymes for the transport or processing of a 
polysaccharide or oligosaccharide unit including wzx or 
wzy genes, wherein the panel of nucleic acid molecules is 
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specific to a particular bacterial polysaccharide antigen. 
The panel of nucleic acid* molecules can include pairs of 
nucleic acid molecules derived from sugar pathway genes 
where necessary. 

In another aspect, the invention relates to a method 
of testing a sample for the presence of one or more 
particular bacterial O antigens comprising contacting the 
sample with at least one oligonucleotide molecule capable 
of specifically hybridising to: (i) a gene encoding an O 
antigen transferase, or (ii) a gene encoding an enzyme for 
transport or processing of the oligosaccharide or 
polysaccharide unit, including a wzx or wzy gene; wherein 
said gene is involved in the synthesis of the particular O 
antigen; under conditions suitable to permit the at least 
one oligonucleotide molecule to specifically hybridise to 
at least one such gene of any bacteria expressing the 
particular bacterial O antigen present in the sample and 
detecting any specifically hybridised oligonucleotide 
molecules. Preferably the bacteria are EL. coli or S . 
enterica . More preferably, the coli express the 0157 
serotype or the 0111 serotype. More preferably the S . 
enterica express the C2 or B serotype. Preferably, the 
method is a Southern blot method. More preferably, the 
nucleic acid molecule is labelled and hybridisation of the 
nucleic acid molecule is detected by autoradiography or 
detection of fluorescence. 

The inventors envisage circumstances where a single 
specific oligonucleotide molecule is unavailable . In 
these circumstances a combination of molecules hybridising 
specifically to the target region may be used. Thus the 
invention provides a panel of nucleic acid molecules for 
use in the method of testing of the invention, wherein the 
nucleic acid molecules are derived from genes encoding 
transferases and/or enzymes for the transport or 
processing of a polysaccharide or oligosaccharide unit 
including wzx or wzy genes, wherein the panel of nucleic 
acid molecules is specific to a particular bacterial 0 
antigen. Preferably the particular bacterial O antigen is 
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expressed by £L_ enterica . The panel of nucleic acid 
molecules can include nucleic acid molecules derived from 
sugar pathway genes where necessary. 

In another aspect, the invention relates to a method 
of testing a sample for the presence of one or more- 
particular bacterial O antigens comprising contacting the 
sample with at least one pair of oligonucleotide molecules 
with at least one oligonucleotide molecule of the pair 
being capable of specifically hybridising to: (i) a gene 
encoding an O antigen transferase, or (ii) a gene encoding 
an enzyme for transport or processing of the 
oligosaccharide or polysaccharide unit, including a wzx or 
wzy gene; wherein said gene is involved in the synthesis 
of the particular O antigen; under conditions suitable to 
permit the at least one oligonucleotide molecule to 
specifically hybridise to at least one such gene of any 
bacteria expressing the particular bacterial O antigen 
present in the sample and detecting any specifically 
hybridised oligonucleotide molecules. 

Preferably the bacteria are JL_ coli or iL_ enterica . 
More preferably, the E_-_ coli are of the 0111 or the 0157 
serotype. More preferably the £L. enterica express the C2 
or B serotype. Preferably, the method is a polymerase 
chain reaction method. More preferably the oligonucleotide 
5 molecules for use in the method of the invention are 
labelled. Even more preferably the hybridised 
oligonucleotide molecules are detected by electrophoresis. 
Preferred oligonucleotides for use with 0111 which provide 
for specific detection of 0111 are illustrated in Table 5 
0 and 5A with respect to the genes wbdH, wzx, wzy and wbdM. 
Preferred oligonucleotide molecules for use with 0157 
which provide for specific detection of 0157 are 
illustrated in Table 6 and 6A. 

With respect to serotypes C2 and B, suitable 
5 oligonucleotide molecules can be selected from appropriate 
regions described in column 3 of Tables 7 and 8 . 

The inventors envisage rare circumstances whereby two 
genetically similar gene clusters encoding serologically 
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different O antigens have arisen through recombination of 
genes or mutation so as to generate polymorphic variants . 
In these circumstances multiple pairs of oligonucleotides 
may be selected to provide hybridisation to the specific 
5 combination of genes . The invention thus provides a panel 
containing pairs of nucleic acid molecules for use in the 
method of testing of the invention, wherein the pairs of 
nucleic acid molecules are derived from genes encoding 
transferases and/or enzymes for the transport or 
10 processing of a polysaccharide or oligosaccharide unit 

including wzx or wzy genes, wherein the panel of nucleic 
acid molecules is specific to a particular bacterial O 
antigen. Preferably the particular bacterial O antigen is 
expressed by enterica . The panel of. nucleic acid 
15 molecules can include pairs of nucleic acid molecules 
derived from sugar pathway genes where necessary. 

In another aspect, the invention relates to a method 
for testing a food derived sample for the presence of one 
or more particular bacterial O antigens comprising 
20 contacting the sample with at least one pair of 

oligonucleotide molecules with at least one oligonucleotide 
molecule of the pair being capable of specifically 
hybridising to: (i) a gene encoding an O antigen 
transferase, or (ii) a gene encoding an enzyme for 
25 transport or processing of the oligosaccharide or 

polysaccharide unit, including a wzx or wzy gene; wherein 
the gene is involved in the synthesis of the particular O 
antigen; under conditions suitable to permit the at least 
one oligonucleotide molecule to specifically hybridise to 
30 at least one such gene of any bacteria expressing the 

particular bacterial polysaccharide antigen present in the 
sample and detecting any specifically hybridised 
oligonucleotide molecules . Preferably the bacteria are 3EL_ 
coli or iLu enterica . More preferably, the EL. coli are of 
35 the 0111 or 0157 serotype. More preferably the S. 

enterica are of the C2 or B serotype. Preferably, the 
method is a polymerase chain reaction method. More 
preferably the oligonucleotide molecules for use in the 
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method of the invention are labelled. Even more 
preferably the hybridised oligonucleotide molecules are 
detected by electrophoresis . 

In another aspect the present invention relates to a 
method for testing a faebal derived sample for the presence 
of one or more particular bacterial O antigens comprising 
contacting the sample witti &t least one pair of 
oligonucleotide molecules with at least one oligonucleotide 
molecule of the pair bfeing capable of specifically 
hybridising to: (i) a gene encoding an O antigen 
transferase, or (ii) a gene encoding an enzyme for 
transport or processing of the oligosaccharide or 
polysaccharide unit, iricit^dirig 'a wzx or wzy gene; wherein 
said gene is involved iri tiie synthesis of the particular O 
antigen; under conditions suitable to permit the at least 
one oligonucleotide molecule to specifically hybridise to 
at least one of * said ' a*iy bacteria expressing the 

particular bacteria:! i editigeri present in the sample and 
detecting any specifically hybridised oligonucleotide 
0 molecules. Preferably ttie bacteria are £L_ coli or 

enterica . More preferably, the EL. coli are of the 0111 or 
0157 serotype. More preferably, the iL*. enterica are of 
the C2 or B serotype . ';V':p^£^ably , the method is a 
polymerase chain res^ti^P^t^d- Mdre preferably the 
5 oligonucleotide molecules for use in the method of the 
invention are label leid. Eir4n more preferably the 
hybridised oligonucleotide triblecules are detected by 
electrophoresis. ' 



0 method for testing a sample derived from a patient for the 



In another aspect, the present invention relates to a 



presence of one or more particular bacterial O antigens 
comprising contacting tlie sample with at least one pair of 
oligonucleotide molectaf^ kt least one oligonucleotide 

molecule of the capde : of specifically 

hybridising to: (ij^ a ^coding an O antigen 

transferase, or (ii) a gene encoding an enzyme for 
transport or processing of the oligosaccharide or 
polysaccharide unit, including a wzx or wzy gene; wherein 
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said gene is involved in the synthesis of the particular O 
antigen; under conditions suitable to permit the at least 
one oligonucleotide molecule to specifically hybridise to 
at least one such gene of any bacteria expressing the 
particular bacterial O antigen present in the sample and 
detecting any specifically hybridised oligonucleotide 
molecules. Preferably the bacteria are EL. coli or S . 
enterica . More preferably, the EL. coli are of the 0111 or 
0157 serotype- More preferably, the £L_ enterica are of 
the C2 or B serotype. Preferably, the method is a 
polymerase chain reaction method. More preferably the 
oligonucleotide molecules for use in the method of the 
invention are labelled. Even more preferably the 
hybridised oligonucleotide molecules are detected by 
electrophoresis . 

In the above described methods it will be understood 
that where pairs of oligonucleotides are used one of the 
oligonucleotide sequences may hybridise to a sequence that 
is not from a transferase, wzx or wzy gene. Further where 
0 both hybridise to one of these gene products they may 

hybridise to the same or a different one of these genes. 

In addition it will be understood that where cross 
reactivity is an issue a combination of oligonucleotides 
may be chosen to detect a combination of genes to provide 
5 specificity. 

The invention further relates to a diagnostic kit 
which can be used for the detection of bacteria which 
express bacterial polysaccharide antigens and the 
identification of the bacterial polysaccharide type of 
0 those bacteria. 

Thus in a further aspect, the invention relates to a 
kit comprising a first vial containing a first nucleic 
acid molecule capable of specifically hybridising to: (i) 
a gene encoding a transferase, or (ii) a gene encoding an 
5 enzyme for transport or processing oligosaccharide or 

polysaccharide, including a wzx or wzy gene, wherein the 
said gene is involved in the synthesis of a bacterial 
polysaccharide. The kit may also provide in the same or a 





WO 98/50531 



PCT/AU98/00315 



21 



separate vial a second specific nucleic acid capable of 
specifically hybridising ~ to : (i) a gene encoding a 
transferase, or (ii) a gene encoding an enzyme for 
transport or processing oligosaccharide or polysaccharide, 
5 including a wzx or wzy gene, wherein the said gene is 

involved in the synthesis of a bacterial polysaccharide, 
wherein the sequence of the second nucleic acid molecule 
is different from the sequence of the first nucleic acid 
molecule . 

10 In a further aspect the invention relates to a kit 

comprising a first vial containing a first nucleic acid 
molecule capable of specifically hybridising to: (i) a 
gene encoding a transferase, or (ii) a gene encoding an 
enzyme for transport or processing oligosaccharide or 

15 polysaccharide including wzx or wzy, wherein the said gene 
is involved in the synthesis of a bacterial O antigen. 
The kit may also provide in the same or a separate vial a 
second specific nucleic acid capable of specifically 
hybridising to: (i) a gene encoding a transferase ., or 

20 (ii) a gene encoding an enzyme for transport or processing 
oligosaccharide or polysaccharide including wzx or wzy, 
wherein the said gene is involved in the synthesis of O 
antigen, wherein the sequence of the second nucleic acid 
molecule is different from the sequence of the first 

25 nucleic acid molecule. Preferably the first and second 
nucleic acid sequences are derived from cojLi or the 
first and second nucleic acid sequences are derived from 
S. enterica. 



30 the 0157 gene cluster for the first time and recognise 

that from this sequence of this previously uncloned full 
gene cluster appropriate recombinant molecules can be 
generated and inserted for expression to provide expressed 
0157 antigens useful in applications such as vaccines. 

35 

DEFINITIONS 

The phrase, M a nucleic acid molecule derived from a 
gene" means that the nucleic acid molecule has a 



The present inventors provide full length sequence of 
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nucleotide sequence which, is either identical or 
substantially similar to "all or part of the identified 
gene. Thus a nucleic acid molecule derived from a gene 
can be a molecule which is isolated from the identified 
5 gene by physical separation from that gene, or a molecule 
which is artif icially ^ synthesis ed and has a nucleotide 
sequence which is either identical to or substantially 
similar to all or part of the identified gene. While some 
workers consider only the DNA strand wxth the same 
10 sequence as the mRNA transcribed from the gene, here 
either strand is intended. 

Transferase genes are regions of nucleic acid which 
have a nucleotide sequence which encodes gene products 
that transfer monomer ie siigar units. 
15 Flippase or : ^^es ! 'are regions of nucleic acid 

which have a nlidl:|5&txM A sequence which encodes a gene 
product that 'ieiij^ repeat units generally 

composed of fc^W 1!^^ sugar units to the 

external surface of me^rane . 

20 Polymerase or ^eri^is are regions of nucleic acid 

which have a nucleotide sequence which encodes gene 
products that polymerise repeating oligosaccharide units 
generally composed of 3^-6 monomer ic sugar units. 
The nucleotide ■ .s^i^a^^dftcfes , provided in this 
25 specification ^e" d^^rifcetf ; in- '"the sequence listing as 
anti-sense sequences t V This term is used in the same 
manner as it is ^ uSed" iih Gl6ssairy of Biochemistry and 
Molecular Biology>-R^i^^'.E^ilt.'ion f David M. Glick, 1997 
Portland Press Ltd., London on page 11 where the term is 
30 described as referring to one of the two strands of 
double-stranded DMA usually that which has the same 
sequence as the mRNA . We* use it to describe this strand 
which has the /,same>< s*e^^^'^a:fe the mRNA. 
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nomenclature: 

Synonyms for EL_ coli 0111 rfb 



£3 



Current names 


Out* Tir^Tnec; 


Bastin et al 19 


wbdH 


orfl 




gmd 


orf2 




wbdl 


orf3 


orf 3. 4* 


manC 


orf4 


rfbM* 


manB 


orf5 


rfbK* 


wbdJ 


orf 6 


orf 6 .7* 


wbdK 


orf7 


orf 7 .7* 


wzx 


orf 8 


orf 8. 9 and rfbX* 


wzy 


orf9 




wbdL 


orflO 




wbdM 


orfll 





10 



15 * Nomenclature according to Bastin D.A. , et al . 1991 "Molecular 
cloning and expression in Escherichia coli K-12 of the rfb gene 
cluster determining the O antigen of an 12*. coli Olll strain". Mol . 
Microbiol. 5:9 2223-2231. 



fij 


20 


Other 


Synonyms 








wzy 


rfc 








wzx 


rfbX 




yJ 




rmlA 


rfbA 






rmlB 


rfbB 






25 


rmlC 
rmlD 


rfbC 
rfbD 








gif 


orf 6* 




■ess? 




wbbl 


orf3#, 


orf8* of E . coli K-12 






wbbJ 


orf2#, 


orf 9* of E. coli K-12 


t.fl 


30 


wbbK 


orfl#, 


orf 10* of E. coli K-12 






wbbL 


orf5#, 


orf 11* of E. coli K-12 






# 


Nomenclature according to Yao, Z. And M. A. Valvano 1994. 






*Genetic analysis of the 


O-specific lipopoly saccharide biosynthesis 



region (rfb) of Eschericia coli K-12 W3110: identification of genes 
35 the confer groups - specif icty to Shigella flexineri serotypes Y and 
4a*. J. Bacterid. 176: 4133-4143. 

* Nomenclature according to Stevenson et al. 1994. ^Structure of 
the 0-antigen of E. coli K-12 and the sequence of its rfb gene 
cluster". J. Bacteriol 176: 4144-4156. 

40 • S . enterica is a name introduced in 1987 to replace the many other 
names such as Salmonella typhi and Salmonella typfrjmur ium , the old 
species names becoming serovar names as in £U enterica sv Typhi. 
However, the traditional names are still widely used. 

• The 0 antigen genes of many species were given rfb names ( rfbA etc) 
45 and the 0 antigen gene cluster was often referred to as the rfb 

cluster. There are now new names for the rfb genes as shown in the 
table. Both terminologies have been used herein, depending on the 
source of the information. 
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• BRIEF DESCRIPTION OF DRAWINGS 

Figure 1 shows Eco R-l restriction maps of cosmid 
clones pPR!054, pPR1055, pPR1056, pPR1058, pPR1287 which 
are subclones of EL. coli 0111 O antigen gene cluster. The 
thickened line is the region common to all clones. Broken 
lines show segments that are non-contiguous on the 
chromosome. The deduced restriction map for coli 
strain M92 is shown above. 

Figure 2 shows a restriction mapping analysis of E . 
coli Olll 0 antigen gene cluster within the cosmid clone 
pPR1058. Restriction enzymes are: (B: BairiHl; Bg: JBgrlll, 
E: EcoRl; H: Hindlll; K: Kpnl ; P: Ps tl ; S: Sail and X: 
Xhol. Plasmids pPR1230, pPR1231, and pPR1288 are deletion 
derivatives of pPR1058 . Plasmids pPR 1237, pPR1238, 
PPR1239 and pPR1240 are in pUC19. Plasmids pPR1243, 
PPR1244, pPR1245, pPR1246 and pPR1248 are in pUC18, and 
PPR1292 is in pUC19 . Plasmid pPR1270 is in pT7T319U. 
Probes 1, 2 and 3 were isolated as internal fragments of 
PPR1246, pPR1243 and pPR1237 respectively. Dotted lines 
indicate that subclone DNA extends to the left of the map 
into attached vector . 

Figure 3 shows the structure of E^ coli Olll O 
antigen gene cluster. 

Figure 4 shows the structure of E^ coli 0157 O 
antigen gene cluster. 

Figure 5 shows the structure jS^ enterica locus 
encoding the serogroup C2 O antigen gene cluster. 

Figure 6 shows the structure £L_ enterica locus 
encoding the serogroup B O antigen gene cluster. 

Figure 7 shows the nucleotide sequence of the E^ coli 
0111 0 antigen gene cluster. Note: (1) The first and last 
three bases of a gene are underlined and of italic 
respectively.; (2) The region which was previously 
sequenced by Bastin and Reeves 1995 ^Sequence and anlysis 
of the O antigen gene (rfb) cluster of Eschericia coli 
olll" Gene 164: 17-23 is marked. 

Figure 8 shows the nucleotide sequence of the EL. coli 
0157 0 antigen gene cluster. Note: (1) The first and last 
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three bases of a gene (region) are underlined and of italic 
respectively (2) The region previously sequenced by Bilge 
et al. 1996 "Role of the Eschericia coli 0157-H7 O side 
chain in adherence and analysis of an rfb locus". Inf. and 
Immun 64:4795-4801 is marked. 

Figure 9 shows the nucleotide sequence of enterica 
serogroup C2 0 antigen gene cluster. Note: 
(1) The numbering is as in Brown et al . 1992. "Molecular 
analysis of the rfb gene cluster of Salmonella serovar 
muenchen (strain M67) : the genetic basis of the 
polymorphism between groups C2 and B" . Mol. Microbiol. 6: 
1385-1394(2) The first and last three bases of a gene are 
underlined and in italics respectively. (3) Only that part 
of the group C2 gene cluster, which differs from that of 
group B, was sequenced and is presented here. 

Figure 10 shows the nucleotide sequence of enterica 
serogroup B O antigen gene cluster Note: (1) The numbering is as 
in Jiang et al . 1991- "Structure and sequence of the rfb (O 
antigen) gene cluster of Salmonella serovar typhimurium (strain 
LT2)". Mol. Microbiol. 5: 695-713. The first gene in the O 
antigen gene cluster is rmlB which starts at base 4099. (2) The 
first and last three bases of a gene are underlined and in 
italics respectively - 



5 BEST METHOD FOR CARRYING OUT THE INVENTION 

Materials and Methods -part 1 

The experimental procedures for the isolation and 
characterisation of the coli Olll O antigen gene 
cluster (position 3,021-9,981) are according to Bastin 
0 D.A., et al. 1991 ^Molecular cloning and expression in 

Escherichia coli K-12 of the rfb gene cluster determining 
the O antigen of an IL_ coli Olll strain*. Mol. Microbiol. 
5:9 2223-2231 and Bastin D. A. and Reeves, P.R. 1995 
* Sequence and analysis of the O antigen gene (r£ b) cluster 
5 of Escherichia coli Olll". Gene 164: 17-23. 
A. Bacterial strains and growth media 

Bacteria were grown in Luria broth supplemented as 
required. 
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B. Cosmids and phage 

Cosmrds in the host strain x2819 were repackaged in 
vivo. Cells were grown in 250mL flasks containing 30mL of 
culture, with moderate shaking at 30°C to an optical 
density of 0.3 at 580 nm. The defective lambda prophage 
was induced by heating in a water bath at 45°C for 15min 
followed by an incubation at 37°C with vigorous shaking 
for 2hr. Cells were then lysed by the addition of 0.3mL 
chloroform and shaking for a further lOmin. Cell debris 
were removed from lmL of lysate by a 5min spin in a 
microcentrifuge, and the supernatant removed to a fresh 
microfuge tube. One drop of chloroform was added then 
shaken vigorously through the tube contents. 

C. DNA preparation 

Chromosomal DNA was prepared from bacteria grown 
overnight at 37°C in a volume of 3 0mlj of Luria broth. 
After harvesting by centrif ugation, cells were washed and 
resuspended in 10mL of 50mMTris-HCl pH 8.0. EDTA was 
added and the mixture incubated for 2 0min. Then lysozyme 
was added and incubation continued for a further lOmin. 
Proteinase K, SDS, and ribonuclease were then added and 
the mixture incubated for up to 2hr for lysis to occur. 
All incubations were at 37°C. The mixture was then heated 
to 65°C and extracted once with 8mL of phenol at the same 
temperature. The mixture was extracted once with 5mL of 
phenol/ chloroform/ iso-amyl alcohol at 4°C. Residual 
phenol was removed by two ether extractions. DNA was 
precipitated with 2 vols, of ethanol at 4°C, spooled and 
washed in 70% ethanol, resuspended in l-2mL of TE and 
dialysed. Plasmid and cosmid DNA was prepared by a 
modification of the Birnboim and Doly method [Birnboim, H. 
C. And Doly, J. (1979) A rapid alkaline extraction 
procedure for screening recombinant plasmid DNA Nucl. Acid 
Res. 7:1513-1523. The volume of culture was lOmL and the 
lysate was extracted with phenol /chlorof orm/iso-amyl 
alcohol before precipitation with isopropanol . Plasmid 
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DNA to be used as vector was isolated on a continuous 
caesium chloride gradient following alkaline lysis of 
cells grown in 1L of culture. 

D. Enzymes and buffers. 

Restriction endonucleases and DNA T4 ligase were 
purchased from Boehringer Mannheim (Castle Hill, NSW, 
Australia) or Pharmacia LKB (Melbourne, VIC Australia). 
Restriction enzymes were used in the recommended 
commercial buffer . 

E. Construction of a gene bank. 

Individual aliquots of M92 chromosomal DNA (strain 
Stoke W, from Statens Serum Institut, 5 Artillerivej , 2300 
Copenhagen S, Denmark) were partially digested with 0.2U 
Sau3Al for l-15mins. Aliquots giving the greatest 
proportion of fragments in the size range of approximately 
40-50kb were selected and ligated to vector pPR691 
previously digested with BairiHl and PvuII. Ligation 
mixtures were packaged in vitro with packaging extract. 
The host strain for transduction was x2819 and 
recombinants were selected with kanamycin. 

F . Serological procedures . 

Colonies were screened for the presence of the Olll 
antigen by immunoblotting . Colonies were grown overnight, 
up to 100 per plate then transferred to nitrocellulose 
discs and lysed with 0.5N HC1 . Tween 2 0 was added to TBS 
at 0.05% final concentration for blocking, incubating and 
washing steps. Primary antibody was E^. coli O group 111 
antiserum, diluted 1:800. The secondary antibody was goat 
anti-rabbit IgG labelled with horseradish peroxidase 
diluted 1:5000. The staining substrate was 4-chloro-l- 
napthol. Slide agglutination was performed according to 
the standard procedure. 

G. Recombinant DNA methods. 

Restriction mapping was based on a combination of 
standard methods including single and double digests and 
sub-cloning. Deletion derivatives of entire cosmids were 
produced as follows: aliquots of 1 . 8jig of cosmid DNA were 
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a 

fU 15 

Q 

vD 
UJ 

jLjj. 

b 20 

CI 



digested in a volume of 2 0p.l with 0.25U of restriction 
enzyme for~5-80min. One half of each aliquot was used to 
check the degree of digestion on an agarose gel. The 
sample which appeared to give a representative range of 
fragments was ligated at 4°C overnight and transformed by 
the CaCl 2 method into JM109. Selected plasmids were 
transformed into S0174 by the same method. P4657 was 
transformed with pPR1244 by electroporation . 
H. DNA hybridisation 

Probe DNA was extracted from agarose gels by 
electroelution and was nick- translated using [a-32P] -dCTP . 
Chromosomal or plasmid DNA was electrophoresed in 0.8% 
agarose and transferred to a nitrocellulose membrane. The 
hybridisation and pre-hybridisation buffers contained 
either 30% or 50% formamide for low and high stringency 
probing respectively. Incubation temperatures were 42°C 
and 37°C for pre-hybridisation and hybridisation 
respectively. Low stringency washing of filters consisted 
of 3 x 20min washes in 2 x SSC and 0.1% SDS. High- 
stringency washing consisted of 3 x 5min washes in 2 x SSC 
and 0.1% SDS at room temperature, a lhr wash in 1 x SSC 
and 0.1% SDS at 58°C and 15min wash in 0 . 1 x SSC and 0.1% 

SDS at 58°C. 

I. Nucleotide sequencing of coli Olll O antigen gene 
cluster (position 3,021-9,981) 



automated sequencer (CA, USA) . The region between map 
positions 3.3 0 and 7.90 was sequenced using 
uni-directional exonuclease III digestion of deletion 
families made in PT7T3190 from clones pPR1270 and pPR1272 . 
Gaps were filled largely by cloning of selected fragments 
into M13mpl8 or Ml3mpl9 . The region from map positions 
7.90-10.2 was sequenced from restriction fragments in 
M13mpl8 or M13mpl9 . Remaining gaps in both the regions 
were filled by priming from synthetic oligonucleotides 
complementary to determined positions along the sequence, 



Nucleotide sequencing was performed using an ABI 373 
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using a single stranded DNA template in M13 or phagemid. 
The oligonucleotides were designed after analysing the 
adjacent sequence. All sequencing was performed by the 
chain termination method. Sequences were aligned using SAP 
5 [Staden, R. , 1982 "Automation of the computer handling of 
gel reading data produced by v the shotgun method of DNA 
sequencing". Nuc. Acid Res. 10: 4731-4751; Staden. R. , 
1986 "The current status and portability of our sequence 
handling software". Nuc. Acid Res. 14: 217-231]. The 
10 program NIP [Staden, R. 1982 "An interactive graphics 

program for comparing and aligning nucleic acid and amino 
acid sequence". Nuc. Acid Res. 10: 2951-2961] was used to 
find open reading frames arid translate them into proteins. 
J. Isolation of clones-carrying EL_ £Ql± 0111 0 antigen 

15 gene cluster 

The I, coli O antigen gene cluster was isolated 
according to the methddVQ.f B$stin D.A. , et al. [1991 
•Molecular cloning and ^expr^s ion in Escherichia coli K-12 
of the rfb gene cluster' determining the O antigen of an E. 
20 coli Olll strain". Mol. Microbiol. 5(9), 2223-2231]. 

Cosmid gene banks of M92 chromosomal DNA were established 
in the in vivo packaging strain x2819. From the genomic 
bank, 3.3 x 10 3 colonies were screened with 1^ coJLi 0111 
antiserum using an immutio^blotting procedure : 5 colonies 
25 (pPR1054, pPR1055 , pPRlV056 ; pPR105 8 arid pPRl287) were 

positive. The cosmids froni 'these strains were packaged in 
vivo into lambda particles and transduced into the 1L_ soli 
deletion mutant S<|>174 which lacks all O antigen genes. In 
this host strain, all plasmxds gave positive agglutination 
30 with 0111 antiserum. An Bco Rl restriction map of the 5 
independent cosmids showed that they have a region of 
approximately 11.5 kb in common (Figure 1). Cosmid 
PPR10S8 included suff i'cletfti 'flanking DNA to identify 
several chromosomal «^rs -linked to 0 antigen gene 
35 cluster and was selected for analysis of the 0 antigen 
gene cluster region. 

K. Restriction mapping of cosmid pPR1058 
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Cosmid pPR1058 was mapped in two stages. A 
preliminary map was constxucted first, and then the region 
between map positions 0.00 and 23.10 was mapped in detail, 
since it was shown to be sufficient for Olll antigen 
5 expression. Restriction sites for both stages are shown 
in Figure 2, The region common to the five cosmid clones 
was between map positions 1.35 and 12.95 of pPR1058. 

To locate the O antigen gene cluster within pPR1058, 
PPR1058 cosmid was probed with DNA probes covering O 

10 antigen gene cluster flanking regions from SL_ enterica LT2 
and E. coli K-12 . Capsular polysaccharide (cps) genes lie 
upstream of O antigen gene cluster while the gluconate 
dehydrogenase (grid) gene and the histidine (his) operon 
are downstream, the latter being further from the O 

15 antigen gene cluster. The probes used were pPR472 

O.SBkb), carrying the gnd gene of LT2 , pPR685 (5.3kb) 
carrying two genes of the cps cluster, cpsB and cpsG of 
LT2, and K350 (16.5kb) carrying all of the his operon of 
K-12. Probes hybridised as follows: pPR472 hybridised to 

20 1.55kb and 3.5 kb (including 2.7 kb of vector) fragments 
of Pstl and Hindlll double digests of pPR!246 (a 
Hindlll/ScoRl subclone derived from pPR1058, Figure 2), 
which could be located at map positions 12.95-15.1; pPR685 
hybridised to a 4 . 4 kb EcoRl fragment of pPR1058 

25 (including 1.3 kb of vector) located at map position 0.00- 
3.05; and K350 hybridised with a 32kb EcoRl fragment of 
PPR1058 (including 4.0kb of vector), located at map 
position 17.30-45,90. Subclones containing the presumed 
gnd region complemented a gmd~edd" strain GB23152. On 

3 0 gluconate bromothymol blue plates, pPR1244 and pPRl292 in 
this host strain gave the green colonies expected of a 
grxTeddT genotype . The his* phenotype was restored by 
plasmid pPR1058 in the his deletion strain S<|>174 on 
minimal medium plates, showing that the plasmid carries 

3 5 the entire his operon. 

It is likely that the O antigen gene cluster region 
lies between gnd and cps, as in other E. coli and S . 
enterica strains, and hence between the approximate map 
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positions 3.05 and 12.95. To confirm this, deletion 
derivatives of pPR1058 were made as follows: first, 
PPR1058 was partially digested with Hindlll and self 
ligated. Transf ormants were selected for kanainycin 
resistance and screened for expression of Olll antigen. 
Two colonies gave a positive reaction. EcdRl digestion 
showed that the two colonies hosted identical plasmids, 
one of which was designated pPR123 0, with an insert which 
extended from map positions 0.00 to 23.10. Second pPR1058 
was digested with Sail and partially digested with Xhol 
and the compatible ends were re-ligated. Transf ormants 
were selected with kanamycin and screened for 0111 antigen 
expression. Plasmid DNA of 8 positively reacting clones 
was checked using EcoRl and Xhol digestion and appeared to 
be identical. The cosmid of one was designated pPR1231. 
The insert of pPR1231 contained the DNA region between map 
positions 0.00 and 15.10. Third, pPR1231 was partially 
digested with Xhol, self -ligated, and transf ormants 
selected on spectinomycin/ streptomycin plates . Clones 
0 were screened for kanamycin sensitivity and of 10 

selected, all had the DNA region from the Xhol site in the 
vector to the Xhol site at position 4.00 deleted. These 
clones did not express the Olll antigen, showing that the 
Xhol site at position 4.00 is within the O antigen gene 
5 cluster. One clone was selected and named pPR1288. 

Plasmids pPRl230, pPR1231, and pPR1288 are shown in Figure 
2. 

L . Analysis of the E. coli Olll O antigen gene 

cluster (position 3,021-9,981) nucleotide sequence data 

0 Bastin and Reeves [1995 "Sequence and analysis of the 

O antigen gene (rfb) cluster of Escherichia coli Olll". Gene 
164: 17-23] partially characterised the E_-_ coli Olll O 
antigen gene cluster by sequencing a fragment from map 
position 3,021-9,981. Figure 3 shows the gene 

5 organisation of position 3,021-9,981 of IL. coli Olll O 

antigen gene cluster. orf3 and orf6 have high level amino 
acid identity with wcaH and wcaG (46.3% and 37.2% 
respectively) , and are likely to be similar in function to 
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sugar biosynthetic pathway genes in the E^. coli K-12 
colanic gene cluster. orf4 and orf5 show high' levels of 
amino acid homology to manC and manB genes respectively. 
orf 7 shows high level homology with rfbH which is an 
abequose pathway gene . or £8 encodes a protein with 12 
transmembrane segments and has similarity in secondary 
structure to other w$x and is likely therefore to be 
the O antigen f lippase~geri$!,; v . 

Materials and MPt hods-p arf q> 

A. Nucleotide sequencing of 1 to 3,020 and 9,982 to 
14,516 of the E^ coli Olll O antigen gene cluster 

The sub clones whic^ contained novel nucleotide 
sequences, pPR123l (malposition 0 and 1,510), pPR1237 
(map position -300 to ? , 744 ) , pPR1239 (map position 2,744 
to 4,168), pPR1245 (malposition 9,736 to 12,007) and 
PPR1246 (k>V W^^2jj<i^o .15,300) . (Figure 2), were 
characterised as follows : :^e distal ends of the inserts 
of pPR1237, pPR1239 and^ppR1245 were sequenced using the 
M13 forward and , reyer.se primers located in the vector. 
PCR walking was carried .out to sequence further into each 
insert using primers based on the sequence data and the 
primers were tagged wi^th M13 forward or reverse primer 
sequences for sea^e^c^aag,, . , .-.This PpR walking procedure was 
repeated until t^;^h^^;^i^sj^t: ; was sequenced. pPR1246 
was characterised g&Q. &$tf&1#pn 12,007 to 14,516. The DNA 
of these sub clones was sequenced in both directions. The 
sequencing reactions were performed using the dideoxy 

anqcycling and reaction products 
were analysed using f.^o.^es.c.ent dye and an API automated 
sequencer (CA, USA) . 

B. Analysis of .the. .E_^ £$1± Olll O antigen gene cluster 
(positions 1 to|^0|^^2j; O 14,516 of SEQ ID NO:l) 



nucleotide s^^^u^ 



The gene , bse^^^^^i^. ^^ ;; -jfr^jari|M^«. of coli Olll 0 
antigen gene c ids iter, vpi-ch ; were not characterised by 
Bastin and Reeves [1995 "Sequence and analysis of the O 
antigen gene (rft>) cluster of Esch e n>hi a coli Olll." Gene 
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164: 17-23] , (positions 1 to 3,020 and 9,982 to 14,516) is 
shown in Figure 3. There are two open reading frames in 
region 1 . Four open reading frames are predicted in 
region 2. The position of each gene is listed in Table 5. 
5 The deduced amino acid sequence of orfl (wbdH) shares 

about 64% similarity with that of the rfp gene of Shigella 
dvsenteriae . Rfp and WbdH have very similar 
hydrophobicity plots and both have a very convincing 
predicted transmembrane segment in a corresponding 
10 position. rfp is a galactosyl transferase involved in the 
synthesis of LPS core, thus wbdH is likely to be a 
galactosyl transferase gene. orf2 has 85.7% identity at 
amino acid level to the gmd gene identified in the coli 
5 K-12 colanic acid gene cluster and is likely to be a gmd 

jjF 15 gene. orf9 encodes a protein with 10 predicted 
IT} transmembrane segments and a large cytoplasmic loop. 

O This inner membrane topology is a characteristic feature 

J?j of all known 0 antigen polymerases thus it is likely that 

s orf9 encodes an 0 antigen polymerase gene, wzy. orflO 

jUs. 

2 0 (wbdL) has a deduced amino acid sequence with low homology 
q with Lsi2 of Neisseria, gonorrhoeae . Lsi2 is responsible 

for adding GlcNAc to galactose in the synthesis of 
^ lipooligosaccharide. Thus it is likely that wbdL is 

either a colitose or glucose transferase gene. orfll 
25 {wbdM) shares high level nucleotide and amino acid 

similarity with TrsE of Yersinia enterocholitica . TrsE is 
a putative sugar transferase thus it is likely that wbdM 
encodes the colitose or glucose transferase. 

In summary three putative transferase genes and an 0 

3 0 antigen polymerase gene were identified at map position 1 

to 3,020 and 9,982 to 14,516 of E^ coli Olll 0 antigen 
gene cluster. A search of GenBank has shown that there 
are no genes with significant similarity at the nucleotide 
sequence level for two of the three putative transferase 
35 genes or the polymerase gene. SEQ ID NO:l and Figure 7 
provide the nucleotide sequence of the Olll antigen gene 
cluster. 
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Materials and Methods -part: 3 

A. PCR amplification of '0157 antigen gene cluster from 
an coli 0157:H7 strain (Strain C664-1992, from Statens 
Serum Institut, 5 Artillerivej , 2300, Copenhagen S, 
Denmark) 

E. coli 0157 O antigen gene cluster was amplified by 
using long PCR [Cheng et al . 1994, Effective amplification 
of long targets from cloned inserts and human and genomic 
DNA" P.N.A.S. USA 91: 5695-569] with one primer (primer 
) #412: att ggt age tgt aag cca agg gcg gta gcg t) based on 
the JumpStart sequence usually found in the promoter 
region of O antigen gene clusters [Hobbs, e t al . 1994 "The 
JumpStart sequence: a 39 bp element common to several 
polysaccharide gene dusted" Mol . Microbiol. 12: 855-856], 
5 and another primer #482 (cac tgc cat acc gac gac gec gat 
ctg ttg ctt gg) based on the gnd gene usually found 
downstream of the O antigen gene cluster. Long PCR was 
carried out using the Expand Long Template PCR System from 
Boehringer Mannheim (Castle Hill NSW Australia) , and 
0 products, 14 kb in length, from several reactions were 
combined and purified using the Pr omega Wizard PCR preps 
DNA purification System (Madison WI USA) . The PCR product 
was then extracted with phenol and twice with ether, 
precipitated with 70% ethanol, and resuspended in 40jlL of 
5 water - 

B. Construction of a random DNase 1 bank: 

Two aliquots containing about 150ng of DNA each were 
subjected to DNase I digestion using the Novagen DNase I 
Shotgun Cleavage (Madison WI USA) with a modified protocol 
\0 as described. Each aliquot was diluted into 45\ll of 0.05M 
Tris -HC1 (pH7.5), 0.05mg/mL BSA and lOmM MnCl 2 . 5^lL of 
1:3000 or 1:4500 dilution of DNasel (Novagen) (Madison WI 
USA) in the same buffer was added into each tube 
respectively and 10*11 of stop buffer (lOOraM EDTA) , 30% 

S5 glycerol, 0.5% Orange G, 0.075% xylene and cyanol 

(Novagen) <Madison WI USA) was added after incubation at 
15°C for 5 min. The DNA from the two DNasel reaction 
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tubes were then combined and fractionated on a 0 . 8% LMT 
agarose gel, and the gel segment with DNA of about lkb in 
size {about 1 . 5mlj agarose) was excised. DNA was extracted 
from agarose using Promega Wizard PGR Preps DNA 
5 Purification (Madison WI USA) and resuspended in 2 00 |XL 
water, before being extracted with phenol and twice with 
ether, and precipitated. The DNA was then resuspended in 
17.25 \LL water and subjected to T4 DNA polymerase repair 
and single dA tailing using the Novagen Single dA Tailing 
10 Kit (Madison WI USA) . The reaction product (85|ll 
containing about 8ng DNA) was then extracted with 
chloroform: isoamyl alcohol (24:1) once and ligated to 3x 
10" 3 pmol pGEM-T (Promega) (Madison WI USA) in a total 
volume of lOOfiL. Ligation was carried out overnight at 
15 4°C and the ligated DNA was precipitated and resuspended 
in 20|XL water before being electroporated into coli 
strain JM109 and plated out on BCIG-IPTG plates to give a 
bank . 

C . Sequencing 

2 0 DNA templates from clones of the bank were prepared 

for sequencing using the 96-well format plasmid DNA 
miniprep kit from Advanced Genetic Technologies Corp 
(Gaithersburg MD USA) The inserts of these clones were 
sequenced from one or both ends using the standard Ml 3 
25 sequencing primer sites located in the pGEM-T vector. 
Sequencing was carried out on an ABI377 automated 
sequencer (CA USA) as described above, after carrying out 
the sequencing reaction on an ABI Catalyst (CA USA) . 
Sequence gaps and areas of inadequate coverage were PCR 

3 0 amplified directly from 0157 chromosomal DNA using primers 

based on the already obtained sequencing data and 
sequenced using the standard M13 sequencing primer sites 
attached to the PCR primers . 

D. Analysis of the 2L_ coli 0157 O antigen gene cluster 
3 5 nucleotide sequence data 

Sequence data were processed and analysed using the 
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Staden programs [Staden, R. , 1982 ^Automation of the 
computer handling of gel "reading data produced by the 
shotgun method 1 of DNA sequencing." Nuc. Acid Res. 10: 
4731-4751; Staden, R. , 1986 "The current status and 
portability of our sequence handling software". Nuc. Acid 
Res. 14: 217-231; Staden, R. 1982 *An interactive graphics 
program for comparing and aligning nucleic acid and amino 
acid sequence''. Nuc. Acid Res. 10: 2951-2961]. Figure 4 
shows the structure of EL. coli 0157 0 antigen gene 
cluster. Twelve open reading frames were predicted from 
the sequence data, and the nucleotide and amino acid 
sequences of all these genes were then used to search the 
GenBank database for indication of possible function and 
specificity of these genes. The position of each gene is 
listed in Table 6. The nucleotide sequence is presented 
in SEQ ID NO: 2 and Figure 8. 

orfs 10 and 11 showed high level identity to manC and 
manB and were named manC and manB respectively. orfl 
showed 89% identity (at amino acid level) to the gmd gene 
0 of the EL coli colanic acid capsule gene cluster 

(Stevenson G. , K. et al . 1996 "Organisation of the 
Escherichia coli K-12 gene cluster responsible for 
production of the extracellular polysaccharide colanic 
acid*. J. Bacteriol. 178:4885-4893) and was named gmd. 
5 orf8 showed 79% and 69% identity (at amino acid level) 

respectively to wcslG of the EL coli colanic acid capsule 
gene cluster and to wbcJ (or f 14. 8) gene of the Yersinia 
enterocolitica 08 O antigen gene cluster (Zhang, L. et al . 
1997 ^Molecular and chemical characterization of the 
0 lipopolysaccharide O-antigen and its role in the virulence 
of L enterocolitica serotype 08".Mol. Microbiol. 23:63- 
76) . Colanic acid and the Yersinia 08 O antigen both 
contain fucose as does the 0157 O antigen. There are two 
enzymatic steps required for GDP-L-fucose synthesis from 
► 5 GDP-4-keto~6-deoxy-D-mannose, the product of the gmd gene 
product. However, it has been shown recently (Tonetti, M 
et al. 1996 Synthesis of GDP-L-fucose by the human FX 
protein J. Biol. Chem. 271:27274-27279) that the human FX 
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protein has "significant homology" with the wcaG gene 
(referred to as Yefb in that paper) , and that the FX 
protein carries out both reactions to convert GDP-4-keto- 
6-deoxy-D-mannose to GDP-L-f ucose . We believe that this 
makes a very strong case for or£8 carrying out these two 
steps and propose to name the gene fcl . In support of the 
one enzyme carrying out both functions is the observation 
that there are no genes other than manB, manC, gmd and fcl 
with similar levels of similarity between the three 
bacterial gene clusters for fucose containing structures. 

orfS is very similar to wbeE (rfbE) of Vibrio 
chplerae 01, which is thought to be the perosamine 
synthetase, which converts GDP-4-keto-6-deoxy-D-mannose to 
GDP-perosamine (Stroeher, U.H et al. 1995 «A putative 
15 pathway for perosamine biosynthesis is the first function 
encoded within the rfb region of Vibrio cholerae' Ol . Gene 
166: 33-42) . v^ cholerae Ol and E^. coli 0157 O antigens 
contain perosamine and N-acetyl -perosamine respectively. 
The V^ cholerae 01 manA, manB, gmd and wbeE genes are the 
only genes of the \L_ choleras ol gene cluster with 
significant similarity to genes of the £^ coli 0157 gene 
cluster and we believe that our observations both confirm 
the prediction made for the function of wbe of V. 
cholerae , and show that orf5 of the 0157 gene cluster 
25 encodes GDP-perosamine synthetase. orfS is therefore 

named per. orfS plus about lOObp of the upstream region 
(postion 4022-5308) was previously sequenced by Bilge, S.S. 
et al. [1996 "Role of the Escherichi a coli 0157-H7 O side 
chain in adherence and analysis of an rfb locus" . Infect . 
30 Immun. 64:4795-4801]. 

or £12 shows high level similarity to the conserved 
region of about 50 amino acids of various members of an 
acetyl transferase family (Lin, W. , et al . 1994 "Sequence 
analysis and molecular characterisation of genes required 
35 for the biosynthesis of type 1 capsular polysaccharide in 
Staphylpcoccup aureus ". J. Bateriol. 176: 7005-7016) and 
we believe it is the N-ac etyl trans f erase to convert GDP- 
perosamine to GDP-perNAc . orf!2 has been named wbdR. 
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The genes manB, manC t gmd, fcl, per and wbdR account 
for all of" the expected biosynthetic pathway genes of the 
0157 gene cluster. 

The remaining biosynthetic step(s) required are for 
synthesis of UDP-GalNAc from UDP-Glc . It has been 
proposed (Zhang, L. , et al. 1997 "Molecular and chemical 
characterisation of the lipopolysaccharide O-antigen and 
its role in the virulence of Yersinia enterocolitica 
serotype 08".Mol. Microbiol. 23:63-76) that in Yersinia 
enterocolitica UDP-GalNAc is synthesised from UDP-GlcNAc 
by a homologue of galactose epimerase (GalE) , for which 
there is a galE like gene in the Yersinia enterocolitica 
08 gene cluster. In the case of 0157 there is no galE 
homologue in the gene cluster and it is. not clear how UDP- 
GalNAc is synthesised. It is possible that the galactose 
epimerase encoded by the galE gene in the gral operon, can 
carry out conversion of UDP-GlcNAc to UDP-GalNAc in 
addition to conversion of UDP-Glc to UDP-Gal. There do 
not appear to be any gene(s) responsible for UDP-GalNAc 
synthesis in the 0157 gene cluster. 

orf4 shows similarity to many wzx genes and is named 
wzx and orf2 which shows similarity of secondary structure 
in the predicted protein to other wzy genes and is for 
that reason named wzy. 

The orfl, orf3 and orf6 gene products all have 
characteristics of transferases, and have been named wbdN, 
wbdO and whdP respectively. The 0157 O antigen has 4 
sugars and 4 transferases are expected. The first 
transferase to act would put a sugar phosphate onto 
undecaprenol phosphate. The two transferases known to 
perform this function, WbaP (RfbP) and WecA (Rfe) transfer 
galactose phosphate and N- acetyl -glucosamine phosphate 
respectively to undecaprenol phosphate. Neither of these 
sugars is present in the 0157 structure. 

Further, none of the presumptive transferases in the 
0157 gene cluster has the transmembrane segments found in 
WecA and WbaP which transfer a sugar phosphate to 
undecaprenol phosphate and expected for any protein which 
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transferred a sugar to undecaprenol phosphate which is 
embedded within the membrane . 

The WecA gene which transfers GlcNAc-P to 
undecaprenol phosphate is located in the Enterobactereal 
Common Antigen (6CA) g£he cluster and it functions in ECA 
synthesis in most and perhaps all IL. coli strains, and 
also in O antigen synthesis for those strains which have 
GlcNAc as the first sXibai: in the O unit. 

It appears that WecA acts as the transferase for 
addition of GalNAc-l-P to undecaprenol phosphate for the 
Yersinia enterocolitis 08 O antigen [Zhang et al.1997 
"Molecular and chemical characterisation of the 
lipopolysaccharide O antigen and its role in the virulence 
of Yersinia enter^^cyiitxca serotype 08" Mol - Microbiol. 
5 23: 63-76.] and perhkt>s does so here as the 0157 structure 
includes GalNAc . WecA has also been reported to add 
Glucose-l-P ptic^sphstfei^ ; 't^^lJmM^c^renol phosphate in |L_ coli 
08 and 09 strains/ aba^ an Alternative possibility for 
transfer of the first sugar to undecaprenol phosphate is 
0 WecA mediated tr4n^ei^' of ' glucose, as there is a glucose 
residue in the 0157 0 antigen. In either case the 
requisite number of transferase genes are present if 
GalNAc or Glc is tratxsf eriresd by WecA and the side chain 
Glc is transferee! ''^ ^'transferase outside of the O 
>5 antigen gene cl}u6t^^^ ' v 

orf9 shows high level 1 similarity (44% identity at 
amino acid level, same ierigtli) with wcstH gene of the IL. 
coli colanic acid ci&psiuie gene cluster. The function of 
this gene is unknown and we give orf9 the name wbdQ. 
30 The DNA between manii and wdbR has strong sequence 

similarity to one of the H-repeat units of 1L_ coli K12 . 
Both of the inverted repeat -sequences flanking this region 
are still recoc^is^le, each with two of the 11 bases 
being changed. '\ ^ associated protein encoding 

35 gene located Wl^i^t^ a 267 base deletion 

and mutations in various positions. It seems that the H- 
repeat unit has been associated with this gene cluster for 
a long period of time since it translocated to the gene 



WO 98/50531 




PCT/AU98/00315 



cluster, perhaps playing a role in assembly of the gene 
cluster as~has been proposed in other cases. 

Materials and Methods - part 4 

To test our hypothesis that O antigen genes for 
transferases and the wzx, wzy genes were more specific 
than pathway genes for diagnostic PCR, we first carried 
out PCR using primers for all the coli 016 O antigen 
genes (Table 4) . The PCR was then carried out using PCR 
primers for coli Olll transferase, wzx and wzy genes 
(Table 5, 5A) . PCR was also carried out using PCR primers 
for the E^ coli 0157 transferase, wzx and wzy genes (Table 
6 , 6A) . 

Chromosomal DNA from the 166 serotypes of coli 
available from Statens Serum Institut, 5 Artillerivej , 
23 00 Copenhagen Denmark was isolated using the Promega 
Genomic (Madison WI USA) isolation kit. Note that 164 of 
the serogroups are described by Ewing W. H. : Edwards and 
Ewings "Identification of the Enterobacteriacea" Elsevier, 
Amsterdam 1986 and that they are numbered 1-171 with 
numbers 31, 47, 67, 72, 93, 94 and 122 no longer valid. 
Of the two serogroup 19 strains we used 19ab strain F8188- 
41. Lior H. 1994 [ *Classif ication of Eschericia coli In 
Esch^rjcia coli in domestic animals and humans pp 31-72. 
Edited by C.L. Gyles CAB international] adds two more 
numbered 172 and 173 to give the 166 serogroups used. 
Pools containing 5 to 8 samples of DNA per pool were made. 
Pool numbers 1 to 19 (Table 1) were used in the coli 
0111 and 0157 assay. Pool numbers 20 to 28 were also used 
in the 0111 assay, and pool numbers 22 to 24 contained E. 
coli 0111 DNA and were used as positive controls (Table 
2) . Pool numbers 29 to 42 were also used in the 0157 
assay, and pool numbers 31 to 3 6 contained £. coli 0157 
DNA, and were used as positive controls (Table 3) . Pool 
numbers 2 to 20, 30, 43 and 44 were used in the £^ coli 
016 assay (Tables 1 to 3). Pool number 44 contained DNA 
of EL. coli K-12 strains C600 and WG1 and was used as a 
positive control as between them they have all of the E^ 



WO 98/50531 — PCT/AU98/00315 

41 

coli K-12 016 O antigen genes. 

PCR reactions were carried out under the following 
conditions: denaturing 94°C/30"; annealing, temperature 
varies (refer to Tables 4 to 8)/30"; extension, 72°C/1'; 
5 30 cycles. PCR reaction was carried out in an volume of 
25|JLL for each pool. After the PCR reaction, lOuL PCR 
product from each pool was run on an agarose gel to check 
for amplified DNA. 

Each IL_ coli and enterica chromosomal DNA sample 
10 was checked by gel electrophoresis for the presence of 

chromosomal DNA and by PCR amplification of the HL_ Sfili or 
s. enterica mdh gene using oligonucleotides based on 1L_ 
coli K-12 or salmonella enterica LT2 [Boyd et al. (1994) 
"Molecular genetic basis of allelic polymorphism in malate 
15 degydrogenase (mdh) in natural populations of Escherichia 
coli and SalmonelJa enterica " Proc . Nat. Acad. Sci. USA. 
91:1280-1284.] Chromosomal DNA samples from other 
bacteria were only checked by gel electrophoresis of 
chromosomal DNA . 

20 

A. Primers based on coli 016 0 antigen gene cluster 
sequence . 

The O antigen gene cluster of L. coli 016 was the 
only typical coli 0 antigen gene cluster that had been 

25 fully sequenced prior to that of Olll, and we chose it for 
testing our hypothesis. One pair of primers for each gene 
was tested against pools 2 to 20, 30 and 43 of soli 
chromosomal DNA. The primers, annealing temperatures and 
functional information for each gene are listed in Table 

30 4. 

For the five pathway genes, there were 17/21, 13/21, 
0/21, 0/21, 0/21 positive pools for rmlB, rmlD, rmlA, rmlC 
and glf respectively (Table 4) . For the wzx, wzy and 
three transferase genes there were no positives amongst 
35 the 21 pools of coli chromosomal DNA tested (Table 4) . 
In each case the #44 pool gave a positive result. 
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B. Primers based on the coli 0111 O antigen gene 
clsuter sequence. 

One to four pairs of primers for each of the 
transferase, wzx and wzy genes of Olll were tested against 
the pools 1 to 21 of E_u co^i chromosomal DNA (Table 5) . 
For wbdH, four pairs of primers, which bind to various 
regions of this gene, were tested and found to be specific 
for Olll as there Was 'nc> Cr euttpl if ied DNA of the correct size 
in any of those 21 pools of E^. coli chromosomal DNA 
tested. Three pairs of primers for wixitf were tested, and 
they are all specific although primers #985/#986 produced 
a band of the wrong size from one pool. Three pairs of 
primers for wzx were"t:esfc|d and they all were specific. 
Two pairs of primers wfer^'t felted for wzy, both are 
specific although #98b7#9»83 gave a band of the wrong size 
in all pools. Oii& paLllr of pirxmers for wbdL was tested and 
found unspecific ' ^^3^po^fe ' no further test was carried 
out. Thus, wzx, ^wci "6^' the three transferase 

genes are highly specif ic to Olll . Bands of the wrong 
0 size found in amplified DNA are assumed to be due to 

chance hybridisation of genes widely present in coli. 
The primers, annealing temperatures and positions for each 
gene are in (Table 5) . 

The 0111 assay viras ; also performed using pools 
5 including DNA f rbm 6 l^fe^^n 'expressing Yersinia 
pseudotuberculosis , Sftiq^iXfr bpycUj. and S^X^QPelj.a 
enterica strains (TaBle SAY . None of the oligonucleotides 
derived from wbdH, wzXr wzy or wbdM gave amplified DNA of 
the correct size witli £ties£ pools. Notably, pool number 
0 25 includes S. ehtfer ic AcLelaide which has the same O 

antigen as E. coli 0111: this pool did not give a positive 
PCR result for any primers tested indicating that these 
genes are highly specific for M E'- coli Olll. 

Each of tlie i l 2 to wbdH, wzx, wzy and 

5 wbdM produces a BSnci' ^i^'^^3tici€fed "size with the pools 

containing 0111 DNA' (p66i^ Q number 22 to 24) . As pools 22 
to 24 included DNA from all strains present in pool 21 
plus 0111 strain DNA (Table 2) , we conclude that the 12 
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pairs of primers all give a positive PCR test with each of 
three unrelated 0111 strains but not with any other 
strains tested. Thus these genes are highly specific for 
coli 0111. 

C. Primers based on the coli 0157 O antigen gene 
cluster sequence. 

Two or three primer pairs for each of the 
transferase, wzx and wzy genes of 0157 were tested against 
IL. coli chromosomal DNA of pools 1 to 19, 29 and 30 (Table 
6) . For wbdN, three pairs of primers, which bind to 
various regions of this gene, were tested and found to be 
specific for 0157 as there was no amplified DNA in any of 
those 21 pools of EL. coli chromosomal DNA tested. Three 
pairs of primers for wbdO were tested, and they are all 
specific although primers # 1211/#1212 produced two or 
three bands of the wrong size from all pools . Three pairs 
of primers were tested for wbdP and they all were 
specific. Two pairs of primers were tested for wbdR and 
they were all specific. For wzy, three pairs of primers 
were tested and all were specific although primer pair 
#12 03/#1204 produced one or three bands of the wrong size 
in each pool. For wzx, two pairs of primers were tested 
and both were specific although primer pair #1217/#1218 
produced 2 bands of wrong size in 2 pools, and 1 band of 
wrong size in 7 pools. Bands of the wrong size found in 
amplified DNA are assumed to be due to chance 
hybridisation of genes widely present in coli . The 
primers, annealing temperatures and function information 
for each gene are in Table 6 . 

The 0157 assay was also performed using pools 37 to 
42, including DNA from O antigen expressing Yersinia 
pseudotubeyculpsis , Shigella bovdii . Yersinia 
enterocolitica 09, Brucella abortus and Salmonella 
enteyjga strains (Table 6A) . None of the oligonucleotides 
derived from wbdN, wzy, wbdO, wzx, wbdP or wbdR reacted 
specifically with these pools, except that primer pair 
#1203/#1204 produced two bands with Y^ enterocolitica 09 
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and one of the bands is of the same size with that from 
the positive control. Primer pair #12 03/ #12 04 binds to 
wzy. The predicted secondary structures of Wzy proteins 
are generally similar, although there is very low 
similarity at amino acid or DNA level among the sequenced 
wzy genes. Thus, it is possible that L. enterolcolitica 
09 has a wzy gene closely related to that of JL_ coli 0157 . 
It is also possible that this band is due to chance 
hybridization of another gene, as the other two wzy primer 
pairs (#1205/#1206 and #1207/#1208) did not produce any 
band with L. enterocolitica 09. Notably, pool number 37 
includes enterica Landau which has the same O antigen 
as EL. coli 0157, and pool 38 and 39 contain DNA of B. 
abortus and Y^. enterocolitica 09 which cross react 
serologically with EL. coli 0157 . This result indicates 
that these genes are highly 0157 specific, although one 
primer pair may have cross reacted with enterocolitica 
09. 

Each of the 16 pairs binding to wbdN, wzx, wzy, wbdO, 
0 wbdP and wbdR produces a band of predicted size with the 
pools containing 0157 DNA (pools number 31 to 36) . As 
pool 29 included DNA from all strains present in pools 31 
to 36 other than 0157 strain DNA (Table 3), we conclude 
that the 16 pairs of primers all give a positive PCR test 
5 with each of the five unrelated 0157 strains. 

Thus PCR using primers based on genes wbdN, wzy, 
wbdO, wzx, wbdP and wbdR is highly specific for coli 
0157, giving positive results with each of six unrelated 
0157 strains while only one primer pair gave a band of the 
0 expected size with one of three strains with O antigens 
known to cross-react serologically with coli 0157 . 

D. Primers based on the Salmonella enterica serotype C2 
and B O antigen gene cluster sequences . 
5 We also performed a PCR using primers for the S_^_ 

enterica C2 and B serogroup transferases, wzx, wzy and 
genes (Tables 7 to 9) . The nucleotide sequences of C2 
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and B O antigen gene clusters are listed as SEQ ID NO: 3 
(Fig. 9) arid SEQ ID NO: 4 (Fig. 10 ) respectively . 
Chromosomal DNA from all the 46 serotypes of Salmonella 
enterica (Table 9) was isolated using the Promega Genomic 
isolation kit # 7 pools of 4 to 8 samples per pool were 
made. Salmonella enterica serotype B or C2 DNA was 
omitted from the pool for testing primers of 46 respective 
serotypes but added to a pool containing 6 other samples 
to give pool number 8 for use as a positive control. 

PCR reactions were carried out under the following 
conditions: denaturing, 94°C/30"; annealing, temperature 
varies (see below) /30"; extension, 72°C/1'; 30 cycles. 
PCR reaction was carried out in a volume of 25pL for each 
pool. After the PCR reaction, lOjiL PCR product from each 
pool was run on an agarose gel to check for amplified DNA. 
For pools which gave a band of correct size, PCR was 
repeated using individual chromosomal samples of that 
pool, and agarose gel was run to check for amplified DNA 
from each sample . 
0 The Salmonella enterica serotype B O antigen gene 

cluster (of strain LT2) was the first 0 antigen gene 
cluster to be fully sequenced, and the function of each 
gene has been identified experimentally [Jiang, X. M. , 
Neal , B . , Santiago , F . , Lee , S . J . , Romana , L . K . , and 
5 Reeves, P. R. (1991) "Structure and sequence of the rfb (O 
antigen) gene cluster of Salmonella serovar typhimurium 
(strain LT2)." Mol . Microbiol. 5(3), 695-713; Liu, D. , 
Cole, R., and Reeves, P. R. (1996). *An O antigen 
processing function for Wzx(RfbX) : a promising candidate 
30 for O-unit flippase" J. Bacterid., 178 (7 ), 2102-2107 ; Liu, 
D., Haase, A. M. , Lindqvist, L., Lindberg, A. A., and 
Reeves, P. R. (1993) . w Glycosyl transferases of O-antigen 
biosynthesis in S. enterica : identification and 
characterisation of transferase genes of groups B, C2 and 
35 El.* J. Bacteriol., 175, 3408-3413; Liu, D. , Lindquist, 
Li . , and Reeves P. R. (1995). transferases of O-antigen 
biosynthesis in Salmonella enterica: dideoxhexosyl 
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transferases of groups B and C2 and acetyl transferase of 
group C2 . " J . Bacteriol./ 177, 4084-4088; Romana, L. K., 
Santiago, F. S-, and Reeves, P. R. (1991). "High level 
expression and purification dThymidine-diphospho-D-glucose 
4,6 dehydratase (rfbB) from Salmonella, serovar typhimurium 
LT2 . " BBRC, 174, 846-852], One pair of primers for each 
of the pathway genes and wbaP was tested against the pools 
of Salmonella enterica DNA, two to three pairs of primers 
for each of the other transferases and wzx genes were also 
tested. See Table 8 for a list of primers and functional 
information of each gene, as well as the annealing 
temperature of the PGR reaction for each pair of primers . 

For pathway genes of group B strain LT2 , there are 
19/45, 14/45, 15/45, 12/45, 6/45, 6/45, 6/45, 6/45, 1/45, 
9/45, 8/45 positives for rmlB, rmlD, rmlA, rmlC, ddhD, 
ddhA, ddhB, ddhC, aJbe, manC, and manB repsectively (Table 
9) . 

For the LT2 wzx gene we used three primer pairs each 
of which gave 1/45 positive. For the 4 transferase genes 
0 we used a total of 9 primer pairs . 2 primer pairs for 
wbaV gave 2/90 positives. For 3 primer pairs of wbaN, 
11/13 5 gave a positive result. For the wbaP primer pair 
10/45 gave a positive result (Table 9) . 

The experimental data show that oligonucleotides 
5 derived from the wzx and wbaV group B O antigen genes are 
specific for group B 0 antigen amongst all 45 Salmonella 
enterica O antigen groups except O group 67 . The 
oligonucleotides derived from Salmonella enterica B group 
wbaN and wbaU genes detected B group O antigen and also 
0 produced positive results with groups A, Dl and D3 . WbaU 
encodes a transferase for a Mannose a (1-4) Mannose linkage 
and is expressed in groups A, B and Dl while wbaN, which 
encodes a transferase for Rhamnose a (1-3) Galactose 
linkage is present in groups A, B, Dl, D2, D3 and El. 
5 This accounts for the positive results with the group B 

wbaU and wbaN genes . The wbaN gene of groups E and D2 has 
considerable sequence differences from that of groups A, 
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B, Dl and D3 and this accounts for the positive results 
only with groups B, Dl ana D3 . 

The Salmonella enterica B primers derived from wzx 
and transferase genes produced a positive result with 
Salmone lla enterica 067. We find that Salmonella enterica 
067 has all the genes of the group B O antigen cluster. 
There are several possible explanations for this finding 
including the possibility that the gene cluster is not 
functional due to mutation and the group 067 antigenicity 
is due to another antigen, or the 0 antigen is modified 
after synthesis such that its antigenicity is changed. 
Salmonella enterica 067 would therefore be scored as 
Salmonella enterica group B in the PCR diagnostic assay. 
However, this is of little importance because Salmonella 
enterica 067 is a rare O antigen and only one (serovar 
Crossness) of the 2324 known serovars has the 067 
serotype [Popoff M.Y. et al (1992) "Antigenic formulas of 
the Salmonella enterica serovars" 6th revision WHO 
Collaborating Centre for Reference and Research on 
Salmonella enterica , Institut Pasteur Paris France] , and 
serovar Crossness had only been isolated once [M. Popoff, 
personal communication] . 

The Salmonella enterica B primers derived from wbaP 
reacted with group A, C2 , Dl, D2 , D3 , El, 54, 55, 67 and 
E4 O antigen groups. WbaP encodes the galactosyl 
transferase which initiates 0 unit synthesis by transfer 
of Galactose phosphate to the lipid carrier Undecaprenol 
phosphate. This reaction is common to the synthesis of 
several O antigens . As such wbaP is distinguished from 
other transf erases of the invention as it does not make a 
linkage within an 0 antigen. 

We also tested 20 primer pairs for the wzx, wzy and 5 
transferase genes of serotype C2 and found no positives in 
all the 7 pools (Table 7) . 

Groups A, B, Dl, D2 , D3 , C2 and El share many genes 
in common. Some of these genes occur with more than one 
sequence in which case each specific sequence can be named 
after one of the serogroups in which it occurs. The 
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distribution of these sequence specificities is shown in 
Table 10. The inventors have aligned the nucleotide 
sequences of Salmonella enteri wzy, wzx genes and 
transferase genes so as to determine specific combinations 
of nucleic acid molecules which can be employed to 
specifically detect and identify the Salmonel 1 a enteri r» 
groups A, B, Dl, D2 , D3 , C2 and El (Table 10). The 
results show that many of the O antigen groups can be 
detected and identified using a single specific nucleic 
acid molecule although other groups in particular D2 and 
El, and A and Dl require a panel of nucleic acid molecules 
derived from a combination of genes. 

It will be understood that in carrying out the 
methods of the invention with respect to the testing of 
particular sample types including samples from food, 
patients and faeces the samples are prepared by routine 
techniques routinely used in the preparation of such 
samples for DNA based testing. 
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TABLE 1 




Pool 


Strains of which chromosonal DNA included in the pool 


Source* 


No. 




1 


£. coli type strains for O serotypes 1, 2, 3, 4, 10, lt>, 18 ana 


IMVS a 


2 


_ ^ JliiiX-JL 1 'iin >ti AQ AO *71 RR and 100 

E. coli type strains for O serotypes 40, 41, 48, 49, /i, o, oo anu iw 


IMVS 


3 


. im inn i in ion 191 176 And 

E. coli type strains for O serotypes 1 102, 109, 119, l^o* l^i, J^- 5 * 


IMVS 




137 




4 


E. coli type strains for O serotypes 138, 139, 149, 7, 5, 6, 11 and 12 


IMVS 


5 


E. coli type strains for O serotypes 13, 14, 15, 17, 19ab, 20, 21 and 22 


IMVS 




E, coli type strains for O serotypes 23, 24, 25, 26, 27, 28, 29 and 30 


IMVS 


7 


E. coli type strains for O serotypes 32, 33, 34, 35, 36, 37, 38 and 42 


IMVS 


g 


E. coli type strains for O serotypes 43, 44, 45, 46, 50, 51, 52 and 53 


IMVS 


9 


E. coli type strains for O serotypes/54, 55, 56, 57, 58, 59, 60 and 61 


IMVS 


10 


E. coli type strains for O serotype? 62, 63, 64, 65, 66, 68, 69 and 70 


IMVS 


1 1 
1 1 


E . coli type strains f6r-O l siiot^ : %r7S;76 f 77, 78, 79, 80 and 81 


IMVS 


12 


E, coli type strains for O serotypes 82^ 83, 84, 85, 86, 87, 89 ana yu 


IMVS 


13 


£. a>« type strains for O serotypes 91, 92, 95, 96, 97, 98, 99 and 101 


IMVC 


14 


E. coli type strains for O serotypes 103, 104, 105, 106, 107, 108 and 110 


IMVS 


i * 


E coli tvoe strains for O serotypes 112, 162, 113, 114, 115, 116, 117 and 


IMVS 




118 , 




io 


F co/z tvoe strains for O serotypes 123, 165, 166, 167, 168, 169, 170 and 


See b 








17 


E. coli type strains for O serotypes 172, 173, 127, 128, 129, 130, 131 and 


See c 




132 




18 


E. coli type strains for O serotypes 133, 134, 135, 136, 140, 141, 142 and 


IMVS 




143 




19 


E. coU type strains for O serotypes 144, 145. 146, 147, 148, 150, 151 and 


IMVS 



152 



a. 
b. 



c. 



Institute of Medical and Veterinary Science, Adelaide, Australia 

123 from IMVS; the rest from Statens Serum Institut, Copenhagen, Denmark 

172 and 173 from Statens Serum Institut, Copenhagen, Denmark, the rest from IMVS 
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TABLE 2 



Pool 
No. 


Strains of which chromosonal DNA included in the pool 


Source* 


20 


2s. C0« type strains tor U serotypes im, i->4, io:>, ioo, ij/, , 1*?^ <mu 
160 


IMVS 


21 


E. coli type strains for O serotypes 161, 163, 164, 8, 9 and 124 


IMVS 


22 


As pool #21, plus E. coli 0111 type strain Stoke W. 


IMVS 


23 


As pool #21, plus E. coli 0111:H2 strain C1250-1991 


See d 


24 


As pool #21, plus E. coli 0111:H12 strain C156-1989 


See e 


25 


As pool #21, plus S. enterica serovar Adelaide 


See f 


26 


Y. pseudotuberculosis strains of O groups IA, IIA, IB, IIC, III, IV A, IVB, 
VA, VB, VI and VH 


See g 


27 


S. boydii strains of serogroups 1, 3, 4, 5, 6, 8, 9, 10, 11, 12, 14 and 15 


See h 


28 


S. enterica strains of serovars (each representing a different O group) Typhi, 
Montevideo, Ferruch, Jangwani, Raus, Hvittingfoss, Waycross, Dan, 
Dugbe, Basel, 65,:i:e,n,z,15 and 52:d:e,n,x,zl5 


IMVS 



d. C1250-1991 from Statens Serum Institut, Copenhagen, Denmark 

e. C156-1989 from Statens Serum Institut, Copenhagen, Denmark 

f . S. enterica serovar Adelaide from IMVS 

g. Dr S Aleksic of Institute of Hygiene, Germany 

h. Dr J Lefebvre of Bacterial Identification Section, Laboratoroie de Sante Publique du 
Qufebec, Canada 
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Pool 
No. 



Strains of which chromosonal DNA included in the pool 



Source* 



29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 

41 
42 

43 
44 



E. coli type strains for O serotypes 153, 154, 155, 156, 158, 159 and 160 

E. coli type strains for O serotypes 161, 163, 164, 8, 9, 111 and 124 

As pool #29, plus E. coli 0157 type strain A2 (Ol57:H19) 

As pool #29, plus E. coli 0157:H16 strain C475-89 

As pool #29, plus E. coli 0157:H45 strain C727-89 

As pool #29, plus E. coli 0157:H2 strain C252-94 

As pool #29, plus E. coli 0157:H39 strain C258-94 

As pool #29, plus E. coli 0157:H26 

As pool #29, plus S. enterica serovar Landau 

As pool #29, plus Brucella abortus 

As pool #29, plus Y. enterocolitica 09 

Y. pseudotuberculosis strains of O groups IA, HA, IIB, IIC, in, IV A, IVB VA 
VB, VI and VII 

S. boydii strains of serogroups 1, 3, 4, 5, 6, 8, 9, 10, 11, 12, 14 and 15 

S. enterica strains of serovars (each representing a different O group) Typhi, 
Montevideo, Ferruch, Jangwani, Raus, Hvittingfoss, Waycross, Dan, Dugbe 
Basel, 65:i:e,n,zl5 and 52:d:e,n,x,zl5 * 

E. coli type strains for O serotypes 1,2,3,4,10,18 and 29 

As pool #43, plus E. coli K-12 strains C600 and WG1 



IMVS 

IMVS 

IMVS 

See d 

See d 

See d 

See d 

See e 

See f 

Seeg 
See h 

See i 

Seej 
IMVS 

IMVS 

IVMS 
See k 



d. 
e. 
f. 

g- 
h. 

i. 
J. 



0157 strains from Statens Serum Institut, Copenhagen, Denmark 

0157:H26 from Dr R Brown of Royal Children's Hospital, Melbourne, Victoria 

S. enterica serovar Landau from Dr M Poppoff of Institut Pasteur, Paris, France 

B. Abortus from the culture collection of The University of Sydney, Sydney, Australia 

Y. enterocolitica 09 from Dr. K. Bettelheim of Victorian Infectious Diseases Reference 

Laboratory Victoria, Australia. 

Dr S Aleksic of Institute of Hygiene, Germany 

Dr J Lefebvre of Bacterial Identification Section, Laboratoroie de Sante Publique du 
Quebec, Canada 

Strains C600 and WG1 from Dr. B.J. Backmann of Department of Biology, Yale 
University, USA. 
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